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Eokaryotic cell division genes and their use in diagnosis and treatment of 

proliferative diseases 

5 

In a first aspect, the present invention is related to the significant functional role of several 
C. elegans genes and of their corresponding gene products in cell division and proliferation 
processes that could be identified by means of RNA-mediated interference (RNAi). 

In a second aspect, the invention relates to the identification and isolation of functional 
10 orthologues of said genes and their gene products found in other eukaryotic species, in 
particular man, including all biologically-active derivatives thereof. 

In a third aspect, the present invention includes the use of said genes and gene products 
(including said orthologues) in the development or isolation of anti-proliferative agents for 
instance their use in q)propriate screenmg assays and m methods for diagnosis and 
15 trestment of proliferative diseases. 

In a forth aspect, the invention relates to antibodies to said gene products and their use in 
the development or isolation of anti-proliferative agents and in methods for diagnosis and 
treatment of proliferative diseases. 

In a fifth aspect, the present invention is related to the use of these genes and gene products 
20 for developing structural models or other models for evaluating drug binding and efficacy 
as well as to any other uses which are derived fix)m the new functions described here and 
which will become ^parent firom the disclosure of the present application for any person 
skilled in the art. 

25 Metazoan cell division consists of an extremely complex, highly regulated set of cellular 
processes which must be tightly co-ordinated, perfectly timed, and closely monitored in 
order to ensure the correct delivery of cellular materials to daughter cells. Defects in these 
processes are known to cause a wide range of so-called proliferative diseases, including all 
forms of cancer. Since cell division represents one of the few, if not the only cellular 

30 process that is common to the aetiology of all forms of cancer, its specific inhibition has 
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long been recognised as a preferred site of therapeutic intervention. Although mitotic 
inhibitor drugs are recognised as one of the most promising classes of chemotherapeutic 
agent, screening attempts to find new drug candidates in this class have been undermined 
by the strong inherent tendency of such screens to identify agents that target a single 
5 protein, tubulin. Tubulin polymerises to form microtubules, the primary cytoskeletal 
elements needed for mitotic spindle function and chromosome segregation. Microtubjule 
functions, however, are ubiquitously needed in almost all cell types, whether dividing or 
not, a fact which therefore explains many of the unwanted side effects caused by anti- 
tubulin drugs. 

10 

Perhaps the best known example of a highly successful anti-neoplastic drug that targets 
tubulin is provided by paclitaxel, and its marketed derivative, Taxol, from Bristol Meyers 
Squibb. Its applicability has indeed been seriously limited by difficulties in detennining an 
adequate dosing regimen due to a range of problematic side effects. Taxol treatment has 

IS resulted in anaphylaxis and severe hypersensitivity reactions characterised by dyspnea and 
hypotension requiring treatment, angioedema, and generalised urticaria in 2-4% of patients 
in clinical trials. All Taxol is administered after pretreatmrat with corticosteroids and 
despite pretreatment, &lal reactions have occurred Severe conductance abnormalities 
resulting in life-Hireatening cardiac anfayfhmia occur in less than 1 percent of patients and 

20 must be treated by insertion of a pacemaker. Taxol can cause fetal harm or fetal deatibi in 
pregnant women. Furthermore, administration is commonly accompanied by tachycardia, 
hypotension, flushing, skin reactions and shortness-of-breatfa (mild dypsnea). 

Despite these shortcomings, Taxol has been hailed by many as the most successful new 
25 anti-cancer ther^utic of the last three decades. Clearly, there is good justification for 
attempting to add to the list of mitotic inhibitors used to treat cancer. However, additional 
drugs that target tubulin or interfere with microtubule dynamics may be expected to have 
similar applicability and limitations as Taxol. 
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The task of the present invention therefore is to find new potential target proteins/genes for 
therapeutical drugs other than tubulin that are essential for completion of mitosis. These 
proteins/genes may provide novel targets to screen for new anti-neoplastic or cytotoxic 
anti-cancer agents. 

5 

Unfortunately, until now, the systematic identification of such target proteins/genes using 
genetic screening methods has been difficult in metazoans, and has relied heavily on the 
use of the unicellular yeast. Several major advances in the use of certain metazoan model 
organisms, particularly the nematode worm Caenorhabditis elegans, have now begun to 
10 offer new ways of bridging this gap. 



The above-mentioned task of the invention to find new potential target proteins/gmes for 
thempeutical drugs other than tubulin involved in mitosis processes is solved by a 
screening assay in C. elegans based on 'genomic RNA mediated interference (RNAi)' 

15 combined with a highly probative microscopic assay for documenting the first roimds of 
embryonic cell division (Sulston et aL^ The embryonic cell lineage of the nematode 
Caenorhabditis elegans. Dev. Biol 100, 64-119 (1983); G6nczy et aly Dissection of cell 
division processes in the one cell stage Caenorhabditis elegans embryo by mutational 
analysis. J Cell Biol 144, 927-946 (1999)). With Ms combination of techniques a selected 

20 gene and also a variety of selected genes can be functionally characterized with 
uiq)recedented speed and efficiency. 



The nematode C elegans exhibits an almost entirely translucent body throughout its 
development, thereby oflFering unparalleled microscopic access for exquisitely detailed 

25 cytological documentation, even for the earliest steps of embryogenesis. This important 
feature, along with its short life cycle (3-5 days), its ease of cultivation, and its low 
maintenance costs, has helped make C elegans arguably the best studied of all metazoans. 
Also, sequence data are now available for over 97% of the C. elegans genome (C. elegans 
Sequi^cing Consortium. Genome sequence of the nematode C. elegans: a platform for 

30 investigating biology. Science 282, 2012-2018 (1998)). Thus, C elegans has proven to be 
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an ideal organism for applying the new technique of RNA-mediated interference (RNAi). 
This technique consists in the targeted, sequence-specific inhibition of gene e5q)ression, as 
mediated by the introduction into an adult worm of double-stranded RNA (dsRNA) 
molecules corresponding to portions of the coding sequences of interest (Fire et a/.. Potent 
5 and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. 
Nature 391, 806-811 (1998)). For the vast majority of C. elegans genes tested to date, this 
has been shown to yield a sequence-specific inhibition of the targeted gene's expression, 
accompanied by clearly detectable loss of fimction phenotypes in the treated worm's Fl 
progeny (and even in some cases, in the treated worm itself). 

10 

A large-scale RNAi technique-based screen was performed for 2,232 (that means 96%) of 
the predicted open reading fimies on chromosome m of C. elegans v^ch is described in 
detail in Gonczy et al., "Functional genomic analysis of cell division in C. elegans using 
RNAi of genes on chromosome HI" Nature 408, 331-336 (2000). For the performance of 
15 this large-scale screen double-stranded RNA corresponding to the individual open reading 
fi^es was produced and micro-injected into adult C. elegans hermaphrodites, and the 
resulting embryos were analysed 24 hours later using time-lapse DIC microscopy. 

Besides others, the C. elegans genes H38K22.2 (Genbank/EMBL ID: AL024499, provided 
in SEQ ID NO. 1 - 3), C02F5.1 (Genbank/EMBL ED: L14745; , provided in SEQ ID NO. 4 
20 and 5) and F10E9.8 (GenBank/EMBL ID: L10986; provided in SEQ ID NO. 6 and 7) gave 
rise to a phenotype detectable by the DIC-assay unplymg a functional role of these genes 
in metazoan cell division processes. 

In at least one case ( for H38K22.2) it had also been possible to identify a stnictuially and 
fimctionally homologous gene, a so-called ortfaologous gene, in another species, in 
25 particular Homo sapiens, namely the human orthologue RF42. 

For the mouse orthologue of the RP42 gene it had merely been known that the gene shows 
a strongly developmentally regulated expression, particularly in proliferating neuroblasts 
firom which neocortical neurons originate (Mas et al., "Cloning and expression of a novel 
gene, RP42, mapping to an autism susceptibility locus on 6Q16" Genomics 1; 65 (1), 70- 
30 74 (2000)). The functional role of RP42 in ceU division and proliferation processes that 
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makes it an excellent tool for the development or identification of drugs for diagnosis and/ 
or ther^y of proliferative diseases was not known so far. 

With the essential function of said genes in cell division and proliferation known, these 
5 newly identified target genes and their corresponding gene products, any homologues, 
orthologues and derivatives thereof represent excellent tools for use in the development 
and isolation of a wide range of tiierapeutics including anti-proliferative agents and in the 
development of methods for diagnosis and treatment of proliferative diseases. 



10 Therefore, in a first aspect, the present invention relates to isolated nucleic acid molecules 
encoding a polypeptide fimctionally involved in cell division and proliferation or a 
fi:agment thereof and comprising a nucleic acid sequence selected fix)m the group 
consisting of: 

(a) the nucleic acid sequences presented in SEQ ID NO. 1 to 3, SEQ ID NO. 4 to S, 
15 SEQ ID NO. 6 to 7, SEQ ID NO. 12 and firagments tiiereof and tiieir 

complementary strands, 

(b) nucleic acid sequences encoding polypeptides that exhibit a sequence identity 
witii SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 1 1 or SEQ 
ED NO. 13 of at least 25 % over 100 residues and/or which are detectable in a 

20 computer aided search using the blast sequence analysis programs with an e- 

value of at most 10"^°, 

(c) nucleic acid sequences which are capable of hybridizmg with the nucleic acid 
sequences of (a) or (b) under conditions of medium stringency, 

(d) nucleic acid sequences ^diich are degenerate as a result of the genetic code to 
25 any of the sequences defined in (a), (b) or (c). 
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The above mentioned fragments of the isolated nucleic acid molecules may comprise a at 
least IS nucleotides and preferably at least 20 nucleotides. 

Additionally the above motioned isolated nucleic acid molecules may be smgle or double- 
stranded DNA-molecules as well as smgle- or double-stranded KNA-molecules. 

5 

a): 

The niideic acid sequences of those nucleic add molecules encoding a polypeptide 
functionally involved in cell division and ptoliferation as mentioned in a) are provided in 
the sequence listing 

10 as SEQ ID NO. 1 - 3 (C. elegans genes H38K22.2 (Genbank/BMOBL ID: AL024499)), 

as SEQ ID NO. 4 and 5 (C elegans gene C02F5.1 (Genbank/EMBL ID: L1474S)), 

as SEQ ID NO. 6 and 7 (C elegans gene F10E9.8 (GeoBank/EMBL ID: L10986)) and 

as SEQ ID NO. 12 (the human H38K22.2 orthologue, the RP42 protein (NCBI Accession 
No. AF292100). 

15 The corresponding deduced amino acid sequences of these target genes are disclosed in 
SEQ ID NO. 8 (for H38K22.2a), in SEQ ID NO. 9 (for H38K:22.2b), in SEQ ID NO. 10 
(for C02F5.1), in SEQ ID NO. 11 (for F10E9.8) and in SEQ ID NO. 13 (for RP42). 



b): 

20 Additionally, the present invention also comprises isolated nucleic acid molecules that are 
structurally and functionally homologous counterparts (particularly orthologues) of at least 
one of said target genes as disclosed in SEQ ID NO 1 to 7 or 12. 

Hiose homologous nucleic acid molecules may encode polypqytides that exhibit a 
sequence identity with SEQ BD NO. 8. SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 11 or 
25 SEQ ID NO. 13 of at least 25 % over 100 residues, preferably of at least 30 % over 100 
residues, more preferably of at least 35 % over 100 residues and most preferably at least 40 
% over 100 residues. 
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Fig. 5 shows hat the aforementioned sequ^ce identities are signifcant homologies that are 
appropriate to identify a polypeptide as an orfhologue of the target proteins as depicted in 
SEQ ED NO. 8 -1 1, and 13. Fig. 5 shows a multiple sequence alignment of the H38K22.2a 
family on protem level generated with a BLAST sequence analysis program. In this 

5 alignment the two C. elegans splice variants H38K22.2a and H38K22.b are compared to 
their corresponding orthologues in DrosopMla (CG7427), in mouse (AAF04863) and in 
Homo sapiens (AAH09478). The statistics in Fig 5 for the alignments show that the 
sequence identity on protein level between the C. elegans clone H38K22.2a and its human 
orthologue (AAH09478) is 36 % over 299 residues. Similarly, the sequence identities 

10 between C. elegans clone H38K22.2b (the other splice variant) and its human orthologue is 
36 % over 238 residues. It is obvious to anyone skilled in the art that these sequence 
homologies are significant homologies and that therefore the human clone with the 
accession No. AAH09478 is unambiguously identified as the human orthologue of the C. 
elegans clones H38K22.2a and H38K22.b. 

IS The invention also comprises isolated nucleic acid molecules that are detectable in a 
computer aided search using one of the BLAST sequence analysis programs with an e- 
value of at most 10'^*^, preferably with an e-value of at most most 10*^^ , more preferably 
with an e-value of at most most 10"^. 

Fig. 5 shows that the aforementioned e-values characterize signifcant sequence homologies 
20 that are appropriate to identify a polypeptide as an orthologue of the target proteuis as 
depicted m SEQ ID NO. 8 -1 1, and 13. 

The BLAST sequence analysis programs are programs used for sequence analysis that ace 
publicaliy available and known to anyone skilled in the art When sequence alignments are 
done by a BLAST sequence analysis program, most of those programs calculate so called 
25 "e-values" to characterize the grade of homology between the compared sequences. 
Generally a small e-value characterizes a high sequence identity / homology, whereas 
larger e-values characterizse lower sequence identities / homologies. 

"Homology" means the degree of identity between two known sequences. As stated above, 
homologies, that means sequence identities, may suitably be detennined by means of 
30 computer programs known in the art. The degree of homology required for the sequence 
variant will depend upon the intended use of the sequence. It is well within the capability 
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of a person skilled in the art to effect mutational, inserdonal and deletional mutations 
which are designed to improve the function of the sequence or otherwise provide a 
methodological advantage. 

5 c): 

The present invention fijrther relates to isolated nucleic acid sequences or fragments 
thereof which are capable of hybridizing with the nucleic acid sequences of (a) or (b) under 
conditions of medium/high stringency. 

The grade of sequence identity between a first and a second nucleic acid molecule can also 
10 be characterized by the capability of the first nucleic acid molecule to hybridize under 
certain conditions to a second nucleic acid molecule. 

Suitable experimental conditions for determining whether a given DNA or RNA sequence 
"hybridizes" to a specified polynucleotide or oligonucleotide probe involve presoaking of 
the filter containing the DNA or RNA to examme for hybridization in S x SSC (sodium 

15 chloride/sodium citrate) buffer for 10 minutes, and prehybridization of the filter in a 
solution of S X SSC, 5 x Denhardf s solution, 0,5 % SDS and 100 mg/ml of denaturated 
sonicated salmon sperm DNA (Maniatis et al.,1989), followed by hybridization in tiie same 
solution containing a concentration of 10 ng/ml of a random primed (Feinberg, A.P. and 
Vogelsteui, B. (1983), ^/la/. Biochem. 132:6-13), ^^P-dCTP-labeled (specific activity > 1 x 

20 10^ cpm/jig) probe for 12 hours at approximately 45°C. The filter is then washed twice for 
30 minutes m 2 x SSC, 0,5% SDS at at least 55°C (low stringency), at least 60^C (medium 
stringency), preferably at least 65 °C (mediuin/high stringency), more preferably at least 
70°C (high stringency) or most preferably at least 75°C (very high stringency). Molecules 
to which the probe hybridizes imder the chosen conditions are detected using an x-ray film. 

25 

d): 

The present invention further relates to isolated nucleic acid molecules or Segments 
thereof which are degenerate as a result of the genetic code to any of the sequences defined 
in(a),(b)or(c). 
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The application of automated gene synthesis provides an opportunity for generating 
sequence variants of the naturally occurring genes. It will be appreciated, for example, that 
polynucleotides coding for the same gene products can be generated by substituting 
synonymous codons for those represented in the naturally occurring polynucleotide 
5 sequences as identified herein. Such sequences will be referred to as '"degenerate" to the 
naturally occurring sequences. In addition, polynucleotides coding for synthetic variants of 
the corresponding amino acid sequences can be generated which, for example, will result 
m one or more amino acids substitutions, deletions or additions. Also, nucleic acid 
molecules comprising one or more synthetic nucleotide derivatives (including 

10 morpholinos) which provide said nucleotide sequence with a desired feature, e.g. a reactive 
or detectable group, can be prepared. Synthetic derivatives with desirable properties may 
also be included in the corresponding polypeptides. All such derivatives and fragments of 
the above identified genes and gene products showing at least part of the biological activity 
of the naturally occurring sequences or which are still suitable to be used, for example, as 

15 probes for, e.g. identification of homologous genes or gene products, are included within 
the scope of the present invention. 

Having herein provided the nucleotide sequences of various genes functionally involved in 
cell division and proliferation, it will be appreciated that automated techniques of gene 

20 synthesis and/or amplification may be used to isolate said nucleic acid molecules in vitro. 
Becaxise of the length of some coding sequences, application of automated synthesis may 
require staged gene construction, in which regions of the gene up to about 300 nucleotides 
m length are synthesized individually and then ligated in correct succession for final 
assembly. Individually sythesized gene regions can be amplified prior to assembly, using 

25 polymerase chaui reaction (PGR) technology. The technique of PGR amplification may 
also be used to directly generate all or part of the final genes/nucleic acid molecules. In this 
case, primers are synthesized which will be able to prime the PGR ampUfication of the 
final product, either in one piece or in several pieces that may be ligated together. For this 
purpose, either cDNA or genomic DNA may be used as the template for the PGR 

30 amplification- The cDNA template may be derived from commercially available or self- 
constructed cDNA libraries. 
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in a second aspect, the invention relates to nucleic acid probes comprising a nucleic acid 
sequence as previously characterized under (a) to (d) which may be a polynucleotide or an 
oligonucleotide comprising at least IS nucleotides containing a detectable label. 
These nucleic acid probes may be synthesized by use of DNA synthesizers according to 

5 standard procedures or, preferably for long sequences, by use of PGR technology with a 
selected template sequence and selected primers. In the use of the nucleotide sequences as 
probes, the particular probe may be labeled with any suitable label known to those skilled 
in the art, including radioactive and non-radioactive labels. Typical radioactive labels 
include ^^P, ^^^I, ^^S, or the like. A probe labeled with a radioactive isotope can be 

10 constructed from a DNA template by a conventional nick translation reaction using a 
DNase and DNA polymerase. Non-radioactive labels include, for example, ligands such as 
biotin or thyroxin, or various luminescent or fluorescent compounds. The probe may also 
be labeled at both ends with different types of labels, for example with an isotopic label at 
one end and a biotin label at the other end. The labeled probe and sample can then be 

IS combuied in a hybridization buffer solution and held at an appropriate temperature until 
annealing occurs. 

The invention also includes an assay kit comprising eitibier an isolated nucleic acid 
molecule as defined above or a fragment thereof or a probe as defined above in a suitable 
20 container. 

Duplex formation and stability dq}end on substantial complementarity between the two 
strands of a hybrid and a certain degree of mismatch can be tolerated. Therefore, the 
nucleic acid molecules and probes of the present invention may include mutations (bodi 
single and multiple), deletions, insertions of the above identified sequences, and 
25 combinations ih^o^ as long as said sequence variants still have substantial sequence 
homology to the original sequence which peimits the formation of stable hybrids with the 
target nucleotide sequence of interest 

The above identified nucleic acid molecules and probes coding for polypeptides 
30 functionally involved in cell division and proliferation or a part thereof will have a wide 
range of useful appUcations, including their use for identifying homologous, in particular 
orthologous, genes in the same or different species, their use in screening assays for 
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identification of interacting drugs that inhibit, stimulate or effect cell division or 
proliferation, their use for developing computational models, structural models or other 
models for evaluating drug binding and eflBcacy, and their diagnostic or therapeutic use for 
detection or treatment of diseases associated with anomalous and/or excessive cell division 

5 or proliferation, in particular neoplastic diseases, including both solid tumors and 
hemopoietic cancers, or coronary restenosis. Exemplary neoplastic diseases include 
carcinomas, such as adenocarcinomas and melanomas; mesodermal tumors, such as 
neuroblastomas and retinoblastomas; sarcomas and various leukemias; and lymphomas. Of 
particular interest are tumors of the breast, ovaries, gastrointestinal tract, liver, lung, 

10 thyroid glands, prostrate gland, brain, pancreas, urinary tract, and salivary glands. Still 
more specific, tumors of the breast, ovaries, lung, colon, and lymphomas are contemplated. 



In a third aspect, the present invention relates to the use of the above identified nucleic acid 
15 molecules and probes for diagnostic purposes. This diagnostic use of the above identified 
nucleic acid molecules and probes may include, but is not limited to the quantitative 
detection of the expression of said target genes in biological probes (preferably, but not 
limited to cell extracts, body fluids, etc.), particularly by quantitative hybridization to the 
endogenous nucleic acid molecules comprising the above-chaiacterized nucleic acid 
20 sequences (particularly cDNA, RNA). An annormal and/or excessive expression of said 
target genes involved in cell division may be diagnosed that way. 

In a forth aspect, the present invention relates to the use of the above identified nucleic 
acid molecules, probes or their corresponding polypeptides for therapeutical purposes. 

25 

This therapeutical use of the above identified nucleic acid molecules, probes or their 
corresponding polypeptides may include, but is not limited to the use of said nucleic acid 
molecules and their corresponding polypq)tides for direct or indirect inhibition of the 
expression of said target genes and/or for inhibition of the- function of said target genes. 
30 Particularly gene therapy vectors, e.g. viruses, or naked or encapsulated DNA or RNA (e.g. 
an antisense nucleotide sequence) with the above-identified sequences might be suitable 
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for the introduction into the body of a subject suffering firom a proliferative disease or fix)m 
a disease affecting cell division for therapeutical purposes. 



A particularly preferred therq)eutical use of the above identified nucleic acid molecules or 
5 probes relates to their use in a therapeutical application of the RNAi technique, particularly 
in humans or in human cells. 

Double-stranded RNA oligonucleotides effect silencing of the e^qpression of gene(s) vs^hich 
are highly homologous to either of the RNA strands in the duplex. Recent discoveries 
reveal that this effect, called RNA interference (RNAi), that had been originally discovered 

10 in C. elegans, can also be observed in cells, particularly in human cells. Therefore the 
invention further comprises the use of double-stranded RNA oligonucleotides with the 
above identified nucleotide sequences (as stated in a) to d)), preferably with a length of at 
least IS nucleotides (nt), more preferably with a length of at least 20 nt, for therapeutical 
silencing of the expression of genes involved in cell division or proliferation in cells ot 

IS otiier species, particularly in human cells. This therapeutical use particularly applies to 
cells of an individual that suffers from a disease associated with anormalous and/or 
excessive cell division or proliferation, particularly a coronary restinosis or a neoplastic 
disease selected from tiie group consistmg of lymphoma, lung cancer, colon cancer, 
ovarian cancer and breast cancer. 

20 

In a fiftti aspect, the invention further comprises a nucleic acid construct or a recombinant 
vector having incorporated the nucleic acid molecules as defined in (a) to (d) or a fiagmmt 
thereof. 

"Nucleic acid construct" is defiaed herein as any nucleic acid molecule, either single- or 
25 double-stranded, in which nucleic acid sequences are combined and juxtaposed in a 
manner ^ch will not occur naturally. The vector may be any vector which can be 
conveniently subjected to recombiiiant DNA procedures. The choice of the vector will 
usually depend on the host cell into which it is to be introduced. Hie vector noiay be an 
extrachromosomal entity, the replication of which is independent of chromosomal 
30 replication, e.g. a plasmid. Altmiatively, the vector may be one \^ch, when introduced 
into a host cell, is integrated into the host cell genome and replicated together with the 
chromosome(s) into which it has been integrated. 
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The vector is preferably an expression vector in which the nucleic acid molecule as defined 
in (a) to (d) or a fi-agment thereof is operably linked to heterologous or homologous control 
sequences. The term ''control sequences'" is defined herein to include all components 
5 which are necessary or advantageous for expression of the coding nucleic acid sequence. 
Such control sequences include, but are not limited to, a promoter, a ribosome binding site, 
translation initiation and tennination signals and, optionally, a repressor gene or various 
activator genes. Control sequences are referred to as "homologous" if they are naturally 
linked to the coding nucleic acid sequence of mteiest and referred to as "heterologous" if 
10 this is not the case. The term "operably linked" indicates that the sequences are arranged so 
that they fimction in concert for their int^ded purpose, i.e. e>q)ression of the desired 
protein. 

The promoter may be any DNA sequence ^ch shows transcriptional activity in the host 
IS ceU of choice and may be derived fiom genes encoding proteins either homologous or 
heterologous to the host cell. 

Examples of suitable promoters for directing the transcription m a bacterial host are, e.g., 
the phage Lambda Pr or Pl promoters, the /oc, trp or tac promoters of K coU, the promoter 
20 of &e Bacillus subtilis alkaline protease gene or the Bacillus licheniformis alpha-amylase 
gene. 

Examples of suitable promoters for directing the transcription in mammalian cells are, e.g., 
the SV40 promoter (Subramani et al., MoL Cell Biol 1 (1981), 854-864), the MT-1 
25 (metallothionein gene) promoter (Pahniter et al.. Science 222 (1983), 809-814) or the 
adenovirus 2 nuyor late promoter. 

Examples of suitable promoters for use in insect cells are, e.g., the polyhedrin promoter 
(Vasuvedan et al., Fehs. Lett 311, (1992), 7-11), the Autogrq>ha califomica polyhedrosis 
30 basic protein promoter (EP 397 485), or the baculovirus immediate early gene 1 promoter 
(US 5,155,037, US 5,162,222). 
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Examples of suitable promoters for use in yeast cells include promoters fcom yeast 
glycolytic genes (Hitzeman et al., I Biol. Chem. 255 (1980), 1203-12080; Alber and 
Kawasaki, J. Mol AppL Gen 1 (1982), 419-434) and the ADH2-4c promoter (Russell et 
al.. Nature 304 (1983), 652-654). 

5 

The coding sequence may, if necessary, be operably linked to a suitable temunator, such as 
the human growth hormone terminator (Pahniter et al.. Science 222, 809-814 (1983)), or a 
polyadenylation sequence. Also, to permit secretion of the expressed protein, a signal 
sequence may precede the coding sequence. 

10 

Further, the vector may comprise a DNA sequence enabling the vector to replicate in the 
host cell in question. Examples of such sequences are the origins of repUcation of the 
plasmids pUC19, pACYC177, pUBllO, pE194, pAMBl and pIJ702. Another example of 
such a sequence (when the host cell is a mammalian cell) is the SV40 origin of replication. 
15 When the host cell is a yeast cell, suitable sequences enabling the vector to replicate are the 
yeast plasmid 2^ replication genes REP 1-3 and origin of replication. 

The vector may also comprise a selectable marker, e.g. a gene coding for a product v^ch 
complements a defect in the host cell, such as the gene coding for dihydrofolate reductase 
20 (DHFR) or a gene which confers resistance to a drug, e.g. ampicillin, kanamycin, 
tetracyclin, chloramphenicol, neomycin or hygromycin. 

A number of vectors suitable for expression in prokaryotic or eukaryotic cells are known in 
the art and several of them are commercially available. Some commercially available 
25 mammalian expression vectors which may be suitable include, but are not limited to, 
pMClneo (Stratagene), pXTl (Stratagene), pSG5 (Stratagene), pcDNAI (Invitrogen), 
EBO-pSV2-neo (ATCC 37593), pBPV-l(8-2) (ATCC 371 10), pSV2-dhfr (ATCC 37146). 

In a sixth aspect, the invention comprises host cells into which the nucleic acid construct or 
30 the recombinant vector is introduced. These host cells may be prokaryotic or eukaryotic, 
including, but not limited to, bacteria, fungal cells, including yeast and filamentous fungi, 
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mammalian cells, including, but not limited to, cell lines of human, bovine, porcine, 
monkey and rodent origin, and insect cells including, but not limited to, dxosophila derived 
cell lines. 

5 The selection of an appropriate host cell will be dependent on a number of factors 
recognized by the art. These include, e.g., compatibility with the chosen vector, toxicity of 
the (co)products, ease of recovery of the desired protein or polypeptide, expression 
characteristics, biosafety and costs. 

Examples of suitable prokaryotic cells are gram positive bacteria such as Bacillus subtilis, 
10 Bacillus licheniformis, Bacillus brevis, Streptomyces lividans etc. or gram negative 
bacteria such as £. coli. 

The yeast host cell may be selected from a species of Saccharomyces or 
Schizosaccharomyces, e,g. Saccharomyces cerevisiae. Useful filamentous fungi may be 
selected from a species oiAspergfllus^ e.g. Aspergillus oryzae or Aspergillus niger. 
15 Cell lines derived fix>m mammalian species which may be suitable and which are 
conmiercially avaUable include, but are not limited to, COS-1 (ATCC CRL 1650) COS-7 
(ATCC CRL 1651), CHO-Kl (ATCC CCL 61), 3T3 (ATCCL 92), NIH/3T3 (ATCC CRL 
1658), HeLa (ATCCL 2), and MRC.5 (ATCC CCL 171). 

20 The recombinant vector may be introduced into the host cells according to any one of a 
number of techniques including, but not limited to, transformation, transfection, protoplast 
fusion, and electroporation. 

The recombinant host cells are then cultivated in a suitable nutrient medium under 
25 conditions p^mitting the expression of the protein of interest The medium used to 
cultivate tiie cells may be any conventional medium suitable for growing the host cells, 
such as minimal or complex media containing appropriate supplements. Suitable media ate 
available from conomiercial suppliers or may be prepared according to published recipes 
(e.g. in catalogues of the American Type Culture Collection). 

30 
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Identificatioii of the heterologoiis polypeptide expressing host cell clones may be done by 
several means, mcluding, but not lunited to, immmiological reactivity with specific 
antibodies. 

5 In a seventh aspect, the invention is related to a method for producing a polypeptide 
functionally involved in cell division and proliferation or a fragment thereof in a host cell 
comprising the steps 

(i) transferring the expression vector with an operably linked nucleic acid 
molecule as defined in (a) to (d) into a suitable host cell, and 

10 (ii) cultivating the host cells of step (i) under conditions which will pennit the 

expression of said polypeptide or fragment thereof and 

(iii) optionally, secretion of the expressed polypeptide into the culture medium. 

In an eigth aspect, the invention comprises a polypeptide functionally involved in cell 
15 division and proliferation or a fragment thereof comprising an amino acid sequence 
selected from the groiqp consisting of: 

(a) the amino acid sequences depicted in SEQ ID NO. 8, 9, 10, 11 and 13 and 
fi:^ments thereof, 

(b) amino acid sequences which exhibit a sequence identity with the sequences of 
20 (a) of at least 25 % over 100 residues, preferably of at least 30 % over 100 

residues, more preferably of at least 35 % over 100 residues and most 
preferably of at least 40 % over a 100 residues and/or which are detectable in a 
computer aided search using the BLAST sequence analysis programs with an e- 
value of at most 10'^*^, preferably with an e-value of at most 10"^^ and most 
25 preferably with an e-value of at most 10"^, 

(c) amino add sequences encoded by a nucleic acid molecule that is enable of 
hybridizmg with the nucleic acid sequences of (a) or (b) or encoded by a nucleic 
acid molecule that is degenerate as a result of tiie genetic code to any of the 
sequences as defined in (a) or (b). 
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The heten>logo\is polypeptide may also be a fusion polypeptide in \^ch another 
polypeptide is fused at the N-tenninus or the C-tenninus of the polypeptide of interest or 
fiagment thereof. A fused polypeptide is pioduced by fusing a nucleic acid sequence (or a 
portion thereoQ encoding another polypeptide to a nucleic acid sequence (or a portion 
S thereof) of the present invmtion. Techniques for producing fusion polypeptides are kno^ 
in the art and include ligating the coding sequences so that they are in fi:ame and the 
e^qnression of the fusion polypeptide is under control of the same promotor(s) and 
terminator. 

10 Expression of the polypeptides of interest may also be performed using in vitro produced 
synthetic mRNA. Synthetic mRNA can be efEcienfly translated in various cell-fiee 
systems, including but not limited to, wheat germ extracts and reticulocyte extracts, as well 
as e£Gciently translated in cell based systems including, but not limited to, microinjection 
into frog oocytes, preferably Xeno/Tu; oocytes. 

15 

In a ninth aspect, the invention involves antibodies against the above identified 
polypeptides and against immunogenic fragments thereof The term ^'antibody*' as used 
herein mcludes both polyclonal and monoclonal antibodies, as well as fragments thereof, 
such as Fv, Fab and F(ab)2 fragments that are capable of binding antigen or hapten. The 

20 present invention also contemplates "humanized'^ hybrid antibodies wherein amino acid 
sequences of a non-human donor antibody exhibiting a desired antigen-specifity are 
combined with sequences of a human acceptor antibody. The donor sequences will usually 
include at least the antigen-binding amino acid residues of the donor but may comprise 
other structurally and/or functionally relevant amino acid residues of the donor antibody as 

25 well. Such hybrids can be prepared by several methods well known in the art (see e.g. WO 
89/09622; WO 94/11509; Couto, Hybridoma 13 (1994), 215-219; Presta, Cancer Research 
57 (1997), 4593-4599). The antibodies of the present invention will have a wide range of 
useful applications, including their use for affinity purification of the corresponding 
immunogenic (poly)peptides, their use for the preparation of anti-idiotypic antibodies, as 

30 well as their use as specific binding agents in various assays, e.g. diagnostic or drug- 
screening assays, or in a method for treatment of diseases associated with anomalous 
and/or excessive cell division or proliferation as exemplified above. Specifically, said 
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antibodies or suitable firagments thereof, particularly in humanized form, may be used as 
therapeutic agents in a method for treating cancer and other diseases associated witti 
anomalous and/or excessive cell division or proliferation as exemplified above. Also, 
antibodies may be raised to the most characteristic parts of the above identified 
5 polypeptides and subsequently be used to identify stracturally and/or fimctionally related 
polypeptides from other sources as well as mutations and derivatives of the above 
identified polypeptides. 

To raise antibodies against the polypeptides of the present invention, there may be used as 
an immimogen either the intact polypeptide or an immunogenic fi:agment thereof, produced 
10 in a suitable host cell as described above or by standard peptide synthesis techniques. 

Polyclonal antibodies are raised by unmunizing animals, such as mice, rats, guinea pigs, 
rabbits, goats, sheep, horses etc., with an appropriate concentration of the polypeptide or 
peptide fragment of interest either with or without an immune adjuvant. 

Acceptable immune adjuvants include, but are not limited to, Freund's complete adjuvant, 
15 Freund's incomplete adjuvant, alum-precipitate, water-in-oil-emulsion containing 
Corynebacterium parvum and tRNA. 

In a typical immunization protocol each animal receives between about 0,1 (ig and about 
1000 ^g of the immunogen at multiple sites either subcutaneously (SC), intraperitoneally 
(DP), intradermally or in any combination fliereof in an initial immunizatioiL The animals 

20 may or may not receive booster injections following the initial injection. Iliose animals 
receiving boosts injections are generally given an equal amount of the immunogen. in 
Freund's incomplete adjuvant by the same route at mtervals of about three or four weeks 
until maximal titers are obtained. At about 7-14 days after each booster immunization or 
about weekly after a single immunization, the animals are bled, the serum collected, and 

25 aliquots are stored at about -20^C. 

Monoclonal antibodies which are reactive with the polypeptide or peptide fiagment of 
interest are prepared using basically the technique of Kohler and Milstein, Nature 256: 
49S-497 (1975). First, animals, e.g. Balb/c mice, are immunized usmg a protocol similar to 
30 that desoibed above. Lymphocytes fi:om antibody-positive animals, preferably 
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splenocytes, are obtained by removing spleens from immunized animals by standard 
procedures known in the art Hybridoma cells are produced by mixing the splenocytes with 
an appropriate fusion partner, preferably myeloma cells, under conditions which will allow 
the formation of stable hybridomas. Fusion partners may include, but are not limited to: 
5 mouse myelomas P3/NSl/Ag 4-1; MPC-1 1; S-194 and Sp 2/0. Fused hybridoma cells are 
selected by growth in a selection medium and are screened for antibody production. 
Positive hybridomas may be grown and injected into, e.g., pristane-primed Balb/c mice for 
ascites production. Ascites fluid is collected about 1-2 weeks after cell transfer and the 
monoclonal antibodies are purified by techniques known in the art. Alternatively, in vitro 
10 production of monoclonal antibodies (mAb) is possible by cultivating the hybridomas in a 
suitable medium, e.g. DMEM with fetal calf serum, and recovering the mAb by 
techniques known in the art. 

Recovered antibody can then be coupled covalentiy to a detectable label, such as a 
radiolabel, enzyme label, luminescent label, fluorescent label or the like, using linker 
1 5 technology established for this purpose. 

Antibody titers of ascites or hybridoma culture fluids are determined by various serological 
or immunological assays which include, but are not limited to, precipitation, passive 
agglutination, enzyme-linked immunosorbent antibody (ELISA) technique and 
radioimmunoassay techniques. Similar assays may be used to detect the presence of the 
20 above identified polypeptides or fragments thereof in body fluids or tissue and cell 
extracts. 

Assay kits for performing the various assays mentioned in the present application may 
comprise suitable isolated nucleic acid or amino acid sequences of the above identified 
genes or gene products, labelled or unlabelled, and/or specific ligands (e.g. antibodies) 
25 thereto and auxiliary reagents as s^propriate and known in the art. The assays may be 
liquid phase assays as well as solid phase assays (i.e. with one or more reagents 
unmobilized on a support). 

Unless otherwise specified, the manipulations of nucleic acids and polypeptidesZ-proteins 
can be performed using standard methods of molecular biology and immunology (see, e.g. 
30 Maniatis et al. (1989), Molecular clonmg: A laboratory manual. Cold Spring Harbor Lab., 
Cold Spring Harbor, NY; Ausubel, F.M. et al. (eds.) '*Current protocols in Molecular 
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Biology". John Wiley and Sons, 1995; Tijssen, P., Practice and Theory of Enzyme 
Immunoassays, Elsevier Press, Amsterdam, Oxford, New York, 1985). 

Hie invention further includes an assay kit comprising either the polypeptide as defined 
S above or a fragment fliereof or an antibody against said polypeptides as defined above or 
against immunogenic fi:agments thereof. 

These recombinant polypeptides or firagments thereof as \vell as antibodies against those 
polypeptides or immimogenic fragments thereof will have a wide range of useful 

10 applications, including their use in screening assays for interacting drugs that inhibit, 
stimulate or effect the cell division or proliferation, their use for developing computational 
models, structural models or other models for evaluating drug binding and eflBcacy, and 
tiieir use m a method for diagnosis or treatment of diseases associated with anomalous 
and/or excessive cell division or proliferation, in particular neoplastic diseases, including 

15 both solid tumors and hemopoietic cancers, or coronary restenosis. Exemplary neoplastic 
diseases include carcinomas, such as adenocarcinomas and melanomas; mesodermal 
tumors, such as neuroblastomas and retinoblastomas; sarcomas and various leukemias; and 
lymphomas. Of particular interest are tumors of the breast, ovaries, gastrointestinal tract, 
liver, lung, thyroid glands, prostrate gland, brain, pancreas, urinary tract, and salivary 

20 glands. Still more specific, tumors of the breast, ovaries, limg, colon, and lymphomas are 
contemplated. 

Therefore in a tenth aspect, the present invention ejqplicitiy includes the use of 
polypeptides as defined above or fi-agments thereof or of antibodies against said 
25 polypeptides or immunogenic firagments thereof in a screening assay for interacting drugs 
that inhibit, stimulate or effect the cell division or proliferation. 

Such a screening assay for interacting drugs may particularly comprise, but is not limited 
to the following steps: 

30 

1 . recombinant e3q)ression of said polypeptide or of an appropriate derivative thereof 
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isolation and optionally purification of the recombinantly expressed polypeptide or 
of its derivative, in particular by afiBnity chromatography 

optionally labelling of the chemical compounds that are tested to interact with said 
polypeptide or its derivative and/or labelling of the recombinantly expressed 
polypeptide 

immobilization of the recombinantly expressed polypeptide or of its derivative to a 
solid phase 

binding of a potential interaction partner or a variety thereof to the immobilized 
polypeptide or its derivative 
optionally one or more washing steps 

detection and/or quantification of the interaction, in particular by monitoring the 
amount of label remaioing associated with the solid phase over background levels. 

Step 1 includes the recombinant expression of the above identified polypeptide or of its 

15 derivative from a suitable expression system, in particular fi-om cell-fi:ee translation, 
bacterial expression, or baculusvirus-based expression in insect cells. 
Stq> 2 comprises the isolation and optionally the subsequent purification of said 
recombinantly expressed polypeptides with appropriate biochemical techniques that are 
familiar to a person skilled in the art. 

20 Alternatively, these screening assays may also include the expression of derivatives of the 
above identified polypeptides which comprises the expression of said polypeptides as a 
fiision protein or as a modified protein, in particular as a GST-fiision protein or as a protein 
bearing a so called "tag"-sequence. These '*tags*'-sequences consist of short nucleotide 
sequences that are ligated *in fi:ame' either to the N- or to the C-temiinal end of the coding 

25 region of said target gene. One of the most common tags that are used to label 
recombmantly expressed genes is the poly-Histidine-tag which encodes a homopolypeptide 
consistmg merely of histidines. In this context the term "polypeptide" does not merely 
comprise polypeptides with the nucleic acid sequences of SEQ ID No. 1 bis 7, their 
naturally occuring homologues, preferably orthologues, more preferably human 

30 orOiologues, in particular the RP42 gene (SEQ ID No. 12), but also derivatives of these 
polypeptides, in particular fiision proteins or polypeptides comprising a tag-sequence. 



2. 
3. 
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These polypeptides, particxilarly those labelled by an appropriate tag-sequence (for 
instance a His-tag) or by GST, may be purified by standard afSnity chromatography 
protocols, in particular by using chromatography resins linked to anti-His-tag-antibodies or 
to anti-GST-antibodies which are both commercially available. Alternatively to the use of 
5 anti-tag- or anti-GST-antibodies or other label-specific* antibodies the purification may 
also involve the use of antibodies against said polypeptides. Screening assays that involve 
a purification step of the recombinantly expressed target genes as described above (step 2) 
are preferred embodiments of this aspect of the invention. 

In a third - optional - step the compoimds tested for interaction may be labelled by 
10 incorporation of radioactive isotopes or by reaction with luminescent or fluorescent 
compounds. Alternatively or additionally also the recombinantly expressed polypeptide 
may be labelled. 

In a forth step the recombinantly expressed polypeptide is immobilized to a solid phase, 

particularly (but not limited) to a chromatography resin. The coupling to the solid phase is 
1 5 thereby preferably established by the gen^ation of covalent bonds. 

In a fifth step a candidate chemical compound that might be a potential interaction partner 

of the said recombinant polypeptide or a complex variety thereof (particularly a drug 

library) is brought into contact with the immobilized polypeptide. 

In a sixth - optional - step one or several washing steps may be performed. As a result just 
20 compounds ibst strongly interact with the immobilized polypeptide remain bound to the 

solid (immobilized) phase. 

In step 7 the interaction between the polypeptide and the specific compound is detected, in 
particular by monitoring the amount of label remaining associated with the solid phase 
over background levels. 

25 

Brief Description of the Drawings 

Fig. 1 shows Die microscopy images taken from time-lapse recording of the first two 
rounds of embryonic cell division m wild ^e C elegans. 

Fig. 2 shows DIG microscopy images taken fiom time-lapse recording of the first two 
30 rounds of embryonic cell division in C. elegans Fl progeny from FO parent treated 

with ds RNA "300C3" or "340G12" directed against gene H38K22.2. 
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Fig. 3 shows DIG microscopy images taken from time-lapse recording of the first two 
rounds of embryonic cell division in C. elegans Fl progeny from FO parent treated 
with dsRNA "307C1" directed against gene C02F5.L 

Fig. 4 shows shows DIG microscopy images taken from time-lapse recording of the first 
5 two romids of embryonic ceil division in C. elegans Fl progeny from FO parent 

treated witti ds BNA *^0SA12'' directed against gene F10E9.8. 

Fig S shows a multiple sequence alignment of the H38K22.2a &mily. Herein, the amino 
acid sequences of the two C. elegans splice variants H38K22.2a and H38K22.2b 
are compared to the amino acid sequences of their orthologues in Drosophila 
10 (GG7427), m mouse (AAF04863) and in homo sapiens (AAH09478). 

The "statistics" refer to values that characterize the grade of homology between the 
individual sequences, as the e-value, the sequence identities and the conservatively 
changed residues (positives). 



15 

Description of the sequence protocol: 

SEQ ID NO. 1 shows the unspliced DNA sequence common to both isoforms a and 
b of the C elegans gene H38K222 (3 104 bp). 

SEQ ID NO. 2 shows the spliced DNA sequence of the C elegans gene H38K22.2a 

20 isoform(1011 bp). 

SEQ ID NO. 3 shows the spliced DNA sequence of the C. elegans gene H38K22.2b 

isoform (852 bp). 

SEQ ID NO. 4 shows the unspliced DNA sequence of the C. elegans gene G02F5. 1 
(3308 bp). 

25 SEQ ID NO. Sshows the spliced DNA sequence of flie C. elegans gene C02F5,1 

(3033 bp). 

SEQ ID NO. 6 shows the unspliced DNA sequence of the C. elegans gene F10E9.8 
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SEQIDN0.7 



SEQIDN0.8 



SEQIDN0.9 



SEQIDNO. 10 



10 SEQIDNO. 11 



SEQIDNO. 12 



SEQIDNO. 13 



IS 



(7097 bp). 

shows tiie spliced DNA sequence of the C. elegans gene Fl 0E9.8 
(3624 bp). 

shows the deduced amino acid sequence of the C. elegans gene 
H38K22.2a isoform (336 aa). 

shows tiie deduced amino add sequence of ttie C. elegans gene 
H38K22.2b isofonn (283 aa). 

shows tiiB deduced amino acid sequence of the C. elegans gene 
C02F5.1 (1010 aa). 

shows the deduced amino acid sequence of the C. elegans gene 
F10E9.8(1207aa). 

shows the cDNA sequence of a human orthologue of H38K22.2 
(780 bp). 

shows the deduced amino acid sequence of a human orthologue of 
H38K222(260aa). 



ine following examples illustrate the present invention widiout, however, limiting the 
20 same thereto. 



EXAMPLE 1: Generation of dsRNA molecules for RNAi experiments 

First, oligonucleotide primer pair sequences were selected to amplify portions of the gene 
25 of interest's coding region using standard PGR techniques. Primer pairs were chosen to 
yield PGR products containing at least 500 bases of coding sequence, or a maximum of 
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coding bases for genes smaller than 500 bases. In order to permit the subsequent use of the 
PGR product as a template for in vitro RNA transcription reactions from both DNA 
strands, the T7 polymerase promoter sequence "TAATACGACTCACTATAGG" was 
added to the 5' end of forward primers, and the T3 polymerase promoter sequence 
5 "AATTAACCCTCACTAAAGG" was added to the 5* end of reverse primers. The 
synthesis of oligonucleotide primers was completed by a commercial supplier (Sigma- 
Genosys, UK or MWG-Biotech, Germany). 

PGR reactions were peifoimed in a volume of SO [il, with Taq polymerase using 0.8 ^M 
primers and approximately 0.1 ^g of wild-type (N2 strain) genomic DNA template. The 

10 PGR products were EtOH precipitated, washed with 70% EtOH and resuspended m 7.0 ^1 
TE. 1.0 ^1 of the PGR reaction was pipetted into each of two fiesh tubes for 5 jil 
transcription reactions using T3 and T7 RNA polymerases. The separate T3 and T7 
transcription reactions were performed according to the manufecturer's instructions 
(Ambion, Megascript kit), each diluted to 50 ^1 with RNase-free water and then combmed. 

15 The mixed RNA was purified using RNeasy kits according to the manufacturer's 
instructions (Qiagen), and eluted into a total of 130 nl of RNase-free H2O. 50 ^il of this 
was mixed with 10 |il 6X injection bulBFer (40 mM KPO4 pH 7.5, 6 mM potassium citrate, 
pH 7.5, 4% PEG 6000). The RNA was annealed by heating at 68°C for 10 min, and at 37^G 
for 30 min. Concentration of the final dsRNAs were measured to be in the range of 0.1-0.3 

20 \ig/\il The products of the PGR reaction, of the T3 and T7 transcription reactions, as well' 
as the dsRNA species were run on 1% agarose gels to be examined for quality control 
purposes. Success of double stranding was assessed by scoring shift in gel mobility with 
respect to single stranded RNA, when run on non-denaturing gels. 

25 

EXAMPLE 2: Injections of dsRNA and phenotypic assays 

dsRNAs were injected bilaterally into the syncitial portion of both gonads of wild-type (N2 
strain) young adult hermaphrodites, and the animals mcubated at 20°G for 24 hrs. 
30 Embryos were then dissected out from the injected animals and analyzed by time-lapse 
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diffeiential interference contrast videomicroscopy for potential defects in cell division 

processes, capturing 1 image every S seconds, as previously described (G6nc^ et al.. 
Dissection of cell division processes in the one cell stage Caenorhabditis elegans embryo 
by mutational analysis. J Cell Biol 144, 927-946 (1999)). For each experiment, embryos 
5 fiom at least 3 different injected womis were fihned in this manner, fiom shordy after 
fertilization until the four cell stage. Embiyos from 2 additional injected worms were also 
recotded via still images, thus yielding phenotypic documentation for at least S injected 
worms in each e?q)eriment 

In some cases, embryos exhibited acute sensitivity to osmotic changes, as evidenced by 
10 their loss of structural integrity during the dissection of the injected animals. In order to 
overcome tiiis limitation, injected animals were not dissected, but rather, anaestiietized for 
10 min in M9 medium containing 0.1% tricaine and 0.01% tetramisole, and mounted intact 
on an agarose pad to observe the Fl embryogenesis in utero (Kirby et al., Dev. Biol. 142, 
203-215 (1990)). The resolution achieved by viewing through the body wall does not equal 
IS that achieved by observing dissected embryos, and only limited phenotypic analysis was 
conducted in these cases. 

Three injected animals were also transferred to a fresh plate 24 hrs after iojection of 
dsRNA, and left at 20°C. Two days later, the plate was checked with a stereomicroscope 
(20^0x total magnification) for the presence of Fl larvae (L2*s-L4*s), as well as their 
20 developmental stage. Two days after that, the plate was inspected again for the presence of 
Fl adults, as well as their overall body morphology and the presence of F2 progeny. 



EXAMPLE 3: Characterization of the C elegam gene H38K22.2 

25 

Two dsRNAs, "300C3" and "340G12", were designed and used to specifically silence the 
expression of the C. elegam gene H38K22.2 by RNAi, thereby testing its fimctional 
involvement in the first 2 rounds of embryonic cell division in this metazoan species. The 
dsRNAs were synthesized in vitro from PCR-amplified wild type genomic DNA fi:agments 
30 of the H38K22.2 gene. For the PGR, two sets of primer pairs were used: 
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"TCAATCAGTATGTCGACCC" with "GGAAGAAATTGGGGAAACA" as forwaiii 
and reverse primers, respectively, to generate dsRNA "300C3", and 
"ATCGAGCGCCTCTTCAATC" with "TGGTGTCTCCATTTGCTGA" as forward and 
reverse primers, respectively, to generate dsRNA "340G12". The dsRNAs were purified, 
5 and injected into adult hermaphrodite worms. The phenotypic consequences of the RNAi 
treatment were documented 24 hours later in the Fl progeny of injected worms, using 
time-lapse differential interference contrast (DIG) microscopy. Embryo recordings started 
~20 minutes after fertilisation, while the female pronucleus is completing its meiotic 
divisions, until the 4 cell ste^e, -30 minutes later. 

10 In the Fl progeny of control woidm that were either not injected, or injected with irrelevant 
dsRNA, the cellular events of the first two rounds of embryonic cell division were found to 
exhibit very limited variability, as observed by DIG microscopy. All processes tiiat were 
examined and scored for the possibility of phenotypic deviations are listed and illustmted 
in Figure 1. Briefly, tiie antero-posterior polarity of the embryo is initially determined by 

IS the position of the male pronucleus at the cortex, shordy after entry mto the egg (right 
arrow in Fig. la). This is accompanied by a clear, coordinated flow of yolk granules 
through the central portion of the cytoplasm along the embryo's longitudinal axis towards 
the male pronucleus, and a concomitant series of cortical waves or ruffles progressing 
towards the anterior of the embryo (left side m Fig.l). Shortly thereafter, the male and 

20 female pronuclei undergo highly patterned migrations (right and left arrows respectively, 
in Fig. la,b) resulting in thek meeting within the posterior half of the embryo (Fig. Ic), 
followed by a centration and rotation (Fig. Id) of the pronuclear pair and associated 
centrosomes (arrov^eads in Fig. Ib-d) to set up the future mitotic spindle along the 
embryo's longitudinal axis. After synchronous breakdown of the pronuclear envelopes, the 

25 clearly bipolar mitotic spindle is initially short (Fig. le), but then elongates vMlq 
exhibiting clear lateral "rocking** movements of the posterior pole (Fig. If-h). These 
movements are accompanied by a slight postoior displacement of the posterior spindle 
pole, v^e the anterior spindle pole remams approximately stationary. This then results in 
an asymmetric positioning of the spindle during anaphase and telophase, thereby yielding 

30 an asymmetric placem^t of the cytokinetic furrow (arrowheads in Fig. lij), and 
gen^ating unequally-sized daughter cells: a smaller posterior PI blastomere (right cell in 
Fig. Ik-o), and larger anterior AB blastomere QeA cell in Fig. Ik-n). While the AB nucleus 
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then migrates directly to the center of the AB cell (left arrow in Fig. lk-1), the PI nucleus 
typically migrates further towards the posterior of that cell (right arrow in Fig. lk-1), before 
undergoing a pronounced 90** rotation while re-migrating to the anterior PI cortex with one 
of its duplicated centrosomes leading (arrowheads in Fig. Im). This insures that the PI 
5 blastomere then divides along the embryo's longitudinal axis, perpendicular to that of the' 
AB blastomere (Fig. In, arro^iieads indicate centrosomes). These two divisions occur 
asynchronously, with PI lagging 2-3 minutes behind AB (Fig. 1 n-p). 

In the Fl embryos of worms injected with dsRNAs "300C3" or "340G12", the following 
highly reproducible phenotypes are observed (Fig. 2). First, although the dynamics of 

10 female pronuclear migration appear normal in all cases, its initiation is often somev^t 
delayed. Meeting and ^iposition of the two pronuclei also typically exhibits defects in that 
the female pronucleus gets captured by only one of the two centrosomes associated with 
the male pronucleus (compare Fig. 2a-c with Fig. la-c). Although this defect is usually 
corrected before pronuclear mvelope breakdown is completed, subsequent positioning of 

15 the mitotic spindle within the embryo often appears defective. Weak manifestation of this 
phenotype appears as a lack of rocking of the posterior spindle pole during an^hase, while 
more severe cases show a notable drift of die entire spindle towards the posterior or lateral 
cortex, reaching the cortex itself and losing its longitudinal alignment completely. In the 
latter cases, the strongly aberrant spindle position gives rise to inappropriate specification 

20 of cleavage furrow formation, leading to anomalous cytokinesis. Even in cases where 
spindle position appears relatively normal, positioning of the daughter Nucleus- 
Centrosomes-Complexes (NCCs) typically uppem abnormal as soon as anaphase ends and 
the cleavage furrow ingresses. This is often particularly visible in the AB blastomere, 
where the NCC, instead of moving directly to the centre of the ceU starting at telophase, 

25 first migrates anteriorly in close proximity to the lateral cortex before eventually centering 
(Fig. 2a-k). This defect is usually accompanied by an apparent absence of interzonal 
spindle microtubules at telophase and a notable bifurcation or foridng of die cytokinetic 
cleavage furrow (arrows in Fig. 2 g), leading to aberrandy-sized daughter blastomeres or 
even Mlure of cytokinesis by complete regression of the furrow (Fig. 2g-m). Nuclear 

30 migration and positioning of die PI nucleus is also aberrant in most cases, resulting in a 
significant delay - or in some cases, a complete failure - in achieving its expected 90° 
rotation and association with the anterior cortex. Division of the PI blastomere is often 
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significantly delayed in such embryos. Finally, defects in female meiotic divisions are also 
occasionally observed, as evidenced by the presence of multiple female pronuclei, 
indicating a fidlure to successfidly extrude one or both polar bodies, which could come 
fi'om cytokinetic defects similar to those noted above. 

5 All observed phenotypes indicate a requirement for H38K22.2 gene function in the 
microtubule-dependent cellular positioning of NCCs and spindles during mitosis, and 
possibly meiosis. Since this function is essential to cell cycle progression and cell division 
throughout metazoans, this gene and any homologues and derivatives thereof represent 
excellent tools for use in the development of a wide range of theis^utics including anti- 

10 proliferative agents. Analysis of the H38K22.2 gene sequence reveals clear orthologues in 
human (NCBI Accession # AAH09478), mouse (NCBI Accession # AAF04863) and 
Drosophila (NCBI Accession # CG7427) (see Fig. 5), all of which have had no known 
functions ascribed to them until now. Based on tiieir extremely high level of sequence 
conservation at the protein level, it can be concluded that all of these genes most likely 

15 encode proteins with equivalent functions in each of then: respective species. The 336 
residue protein encoded by tiie H38K22.2 gene isoform "a" exhibits no known structural 
moti& or consensus domains, according to either SMART or CDD analyses. * 



20 EXAMPLE 4: Characterization of flie C elegans gene C02F5.1 

A dsRNA, "307C 1 was designed and used to specifically silence the e}q)ression of the C 
elegans gene C02FS.1 by RNAi, diereby testing its funcdonal involvement in the first 2 
roimds of embryonic cell division in this metazoan species. The dsRNA was synthesized in 

25 vitro fsxm a PCR-amplified wUd type genomic DNA fi:agment of the C02FS.1 gene. For 
the PGR, oligonucleotides with sequences "ATOTGAAGATCCGTCCACT" and 
"ATGCACAATGGGTAl I 1 1 T were used as forward and reverse primers, respectively, 
to generate dsRNA "307Cr which was purified, and injected into adult hemu^hrodite 
worms. The phenotypic consequences of the RNAi treatment were documented 24 hours 

30 later in the Fl progeny of injected worms, using time-l^se differential interference 
contrast (DIC) microscopy. Embryo recordings started ~20 minutes after fertilisation. 
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the female pronucleus is completing its meiotic divisions, until the 4 cell stage, ~30 
minutes later. 

In the Fl progeny of control worms tiiat were either not injected, or injected with irrelevant 
dsRNA, the cellular events of the first two rounds of embryonic cell division were found to 
5 exhibit very limited variability, as observed by DIC microscopy. All processes that were 
examined and scored for the possibility of phenotypic deviations are listed and illustrated 
in Figure 1. 

Fl embryos fbm parent worms injected with dsRNA "307Cr are consistently found to 
10 exhibit the following phenotypes (Fig. 3). First, all cellular processes that are scorable by 
DIC microscopy until entry into mitosis are typically indistinguishable &om the wild type 
pattern. These include egg shape and size, yolk granule size and density, yolk granule 
flows and cortical rufOing, pseudo-cleavage furrow formation and positioning, pronuclear 
£q)pearance (arrows m Fig. 3a) and migration (Fig. 3a,b), as well as centration and rotation 
IS of pronuclei (Fig. 3b,c) and associated pah: of centrosomes (arro^eads in Fig. 3b,c). 
Formation and positioning of the bipolar mitotic spindle also take place normally, but the 
spindle is most often thinner and less rigid than in wild type, exhibiting aberrant lateral 
bending during its rocking and elongation at anaphase (Fig. 3f-i). After completion of 
cytokinesis, vMoh appears normal, the reforming daughter nuclei are typically tear-shaped, 
20 and remam close to the newly-fonned cortex for a prolonged period (Fig. 3a and k). 
Consistent with the tear shq)e, the two nuclei remain often physically connected by 
anomalous chromatin bridges and karyomeres are also typically seen (asterisks in Fig. 3k 
and 1). This phmotype subsequentiy results in embryonic letiiality in all cases. 

The absence of defects in pronuclear migration and assembly of the bipolar spindle argue 
25 against a role for this gene in more general microtubule functions. The observed defects 
are consistent with a failure in mitotic chromosome segregation, most likely in the 
separation of sister chromatids, resulting in the formation of chromatin bridges, which then 
persist at telophase. The present data therefore indicate an essential requirement for 
C02F5.1 gene function in mitotic chromosome segregation. Since this function is essential 
30 to cell cycle progression and cell division throughout metazoans, this gene and any 
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homologues and derivatives thereof represent excellent tools for use in the development of 
a wide range of therapeutics including anti-proliferative agents. 

Analysis of the C02FS.1 sequence reveals that the encoded 1010 residue protein contains 
regions predicted to form coiled coil structures, i.e. likely protem-protein interaction 
S domains. Sequence homology analyses using the BLASTp program presently reveal no 
clearly orthologous sequences in other organisms. However, considering the essential and 
highly conserved nature of the cellular process in question^ functional orthologues of ibis 
gene/protein are esctremely likely to exist in all metazoans, possibly in all eukaryotes, and 
will be identified using for example the methodology as outlined m EXAMPLE 6. 



EXAMPLE 5: Characteraation of flie C elegans gene F10E9.8 

15 

Two dsRNAs, "30SA12" and "34105", were designed and used to specifically silence the 
e^qpression of the C elegans gene F10E9.8 by RNAi, thereby testing its fimctional 
involvemmt in the first 2 rounds of embryonic cell division in this metazoan species. 
The dsRNAs were synthesized in vitro ftom PCR-anq)lified wild type genomic DNA 

20 fiagments of the F10E9.8 gene. For PGR, two sets of primer pairs were used: 
"TTCGTCTCGAACACGTATATCCr with "GAAAGAAGATGAATCAGGCATTG" as 
forward and revise primers, respectively, to generate dsRNA "30SA12", and 
"CTGCAAAAATTATGACTGTGTCG" with "AGCATTCAGATTTGGTTGTCC" as 
forward and reverse primers, respectively, to generate dsRNA "34105". The dsRNA was 

25 purified, and injected into adult hermE^hrodite worms. The phenotypic consequraces of 
the SNAi treatment were documented 24 hours later in the Fl progeny of injected worms, 
using time-lcqpse differCTtial interference contrast (DIG) microscopy. Embryo recordings 
started '^20 minutes after fertilisation, vAnle the female pronucleus is completing its 
meiotic divisions, until the 4 cell stage, ^^30 minutes later. 

30 In the Fl progeny of control worms that were either not injected, or injected with irrelevant 
dsRNA, the cellular events of the first two rounds of embryonic cell division were found to 
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exhibit very limited variability, as observed by DIG microscopy. All processes that were 
examined and scored for the possibility of phenotypic deviations are listed and illustrated in 
Figure 1. 

In the Fl embryos of worms injected wifli dsRNAs "305A12" or "34105", the following 
5 highly reproducible phenotypes are observed (Fig. 4). First, all cellular processes that are 
scorable by DIG microscopy until the 2-cell stage are typically indistinguishable £rom the 
wild type pattern. These include egg shape and size, yolk granule size and density, yolk 
granule flows and cortical ruBling, pseudo-cleavage furrow formation and positioning, 
pronuclear appearance (arrows in Fig. 4a) and migration (Fig. 4a,b), as well as centration 

10 and rotation of pronuclei (Fig. 4b,c) and associated pair of centrosomes (arrowheads in Fig. 
4b,c). The first round of division also occurs without any detectable deviations from wild 
type (Fig. 4d-h). It should particularly be noted that no defects are observed with respect to 
size, number or positioning of centrosomes or spindle poles in the single cell embryo (note 
arrowheads in Fig. 4b-f). In the two-cell stage embryo, however, although nuclear 

15 positioning also remains equivalent to wild type, an apparent failure in centrosome 
duplication is consistently observed in one of the two blastomeres and sometimes in both. 
A single perinuclear centrosomal region, as seen by its exclusion of yolk granules (black 
arrowhead in Fig. 4h-j), is typically observed instead of the two normally seen both in wild 
type embryos and in the unaffected blastomere (white arrowheads in Fig. 4i j). Despite the 

20 parent feilure m centrosome diq)lication, microtubule-dependent processes continue 
normally, as illustrated by the successful anterior migration of tiie PI nucleus, with its 
single centrosomal region leading (black arro\^ead in Fig. 4h-j). Upon entering mitosis, as 
scored by nuclear mvelope breakdown, the defective blastomere then Ms to generate a 
bipolar spindle, forming instead a monopolar array of microtubules (dashed circle in Fig. 

25 4k), as evidenced by the radial alignments of yolk granules in that regiorL Gytokinesis &ils 
to occur in that blastomere, resulting in reformation of multiple, irregularly sized nuclei, 
known as karyomeres (arrows in Fig. 4m,n). In contrast, all aspects of cell division occur 
normally in the neighboring blastomere, resulting in normal daughter cells, each containing 
a single equal-sized nucleus (arrows in Fig. 41). 

30 The complete failure in bipolar spindle formation, accompanied by the presence of a single 
centrosomal region instead of two in the affected two-cell stage blastomere, clearly 
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indicates a requiinement for F10E9.8 gene function in the complex process of mitotic 
spindle assembly. However, the lack of detectable defects in other microtubule-dependent 
processes including pronuclear migration and spindle function in the single-cell embryo 
effectively rules out a general microtubule-related function. In view of the maternal nature 
5 of the RNAi effect and the fact that the egg inherits its first centrosome paternally, the 
successfiil generation of a bipolar spindle in the smgle-cell embryo further suggests that 
F10E9.8 function may, in feet, be required for some aspect of centrosome duplication or 
separation. 

Indeed, smce sperm development is folly completed within the parent before initiation of 
10 the RNAi treatment, it imiains unaffected by the injected dsRNA. This results in the 
donation of an intact "wild type'* centrosome fiom the sperm to the egg at fertilisation. 
After fertilisation, this akeady bipartite centrosome (i.e. contaming two "replication units", 
as evidenced by the presence of two centrioles) undergoes one round of di^lication, as 
observed in other systems by the budding of a new centriole barrel from each existing 
IS centriole. This is followed by a physical separation of the two centriole pairs and 
associated pericentriolar material. This process is not dependent on the prior duplication 
event, and is solely needed to insure the successfiil formation of the bipolar spindle to be 
used in the first round of embryonic cell division. It therefore appears diat F10E9.8 
function is most likely not required for this process. 

20 5. If the first duplication round fails, however, bipolar spindle formation is e3q)ected to fail 
during the second round of division, as seen here. Interestingly, the fact that this failure 
often occurs only in one of the two blastomeres suggests that in these cases only one of the 
original centrosome's two "replication units" actually failed in its first round of diq)lication 
at the single-cell stage. Tnis observation is consistent with findings firom other eukaryotes 

25 indicating that one of the two replication units contained within the sperm's centrosome 
actually comes into the egg aheady fully equipped for one duplication round, while the 
other must rely on cytoplasmic fectors within the egg to permit its own duplication (Sluder, 
G., Hinchcliffe EH. Control of centrosome reproduction: the right number at the right time. 
BioL Cell 9U 413-27 (1999). 
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The present findings therefore suggest that the requirement for F10E9.8 function in mitotic 
spindle assembly most likely results fiom this gene's essential role in the process of 
centrosome duplication. 

5 Since the process of spindle assembly is essential to cell cycle progression and cell division 
throughout metazoans, this gene and any homologues and derivatives thereof represent 
excellent tools for use in the development of a wide range of therapeutics including anti- 
proliferative agents. Analysis of the F10E9.8 sequence reveals that the encoded 1207 
residue protein contains one large region predicted to form coiled coil structures, i.e. likely 

10 protein-protein interaction domains, and four predicted transmembrane domains. Sequence 
homology analyses using the BLASTp program presently reveal no clearly orthologous 
sequences in other organisms. However, considering the essential and highly conserved 
ilature of the cellular process in question, functional orthologues of this gene/protein are 
extremely likely to exist in all metazoans, possibly all eukaryotes, and will be identified 

1 s using for example the following methodology. 



EXAMPLE 6: Protocol for identifying functional orthologues in other species 

20 The present invention describes genes identified as having essential functions in cell 
division in the model organism C. elegans. The basis for performing research in model 
organisms is that the newly discovered functions for the genes in C. elegans will be 
conserved in other species including humans. Cell division is highly conserved during 
evolution and therefore the approach of discovering a gene function in C. elegans and 

25 using the information to characterise or assign functions for the human orthologue is well 
justified. There are two themes of conservation of genes during evolution. A gene 
sequence may be conserved. This means tiiat the DNA nucleotide sequence of the gene is 
very similar in different species, vduch in turn suggests that the function of the gene is the 
same in the different species. As is known to any person skilled in the art, a sequence 

30 identity or homology above a particular level defines that two genes in different species 
code for the same gene product and gene function. Homologous genes are typically 
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identified by perfonning blast analysis with appropriate software, or by other approaches. 
For a blast search, an e-value of 10'^*^ will extract the significant homologous sequences. 
Further phylogenetic analysis can be performed to identify which of the extracted 
sequences are the orthologues. 

S Therefore the following example for identification of orthologues can be presented. A blast 
search is performed using the blast sequence analysis programs and an e-value of 10'^. An 
alternative parameter can be the percoitage of sequence identity. Over 100 residues, a 
sequence identity of 30% defines a homologous gene. After the blast search is completed, 
multiple sequence alignment is performed using q)propriate software (for example, 
10 CLUSTALW) and a neighbour joining phylogenetic tree is generated. Any person skilled 
in the art can identify the human orthologue from a phylogenetic tree. Essentially, the 
human sequence that is separated on the tree by a single speciation event or most closely 
related on the tree is likely to be an orthologue. 

The second theme of conservation is that the gene function can be conserved with greater 
15 divergence of sequence. In the present invention this theme of conservation is not defined. 
However, if other genes are discovered to have functions that result in the gene product 
being identified as the same gene product as those claimed in the present invention then the 
present claims also apply to such genes. 

20 
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Claims 

1 . An isolated nucleic acid molecule encoding a polypeptide functionally iavolved in cell 
division and proliferation or a fragment thereof and comprising a nucleic acid 
sequence selected fiom the group consisting of: 

(a) the nucleic acid sequences presented in SEQ ED NO. 1 to 3, SEQ ID NO. 4 to 5, 
SEQ ID NO. 6 to 7, SEQ ID NO. 12 and fiagments thereof and flieir 
complementary strands, 

(b) nucleic acid sequences encoding polypeptides that exhibit a sequence identity \sdth 
SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 1 1 or SEQ ID NO. 
13 of at least 25 % over 100 residues and/or which are detectable m a computer 
aided search using the blast sequence analysis programs with an e-value of at most 

(c) nucleic acid sequences which are capable of hybridizing with the nucleic acid 
sequmces of (a) or (b) under conditions of medium/high stringency, 

(d) nucleic acid sequences ^lich are degenerate as a result of the genetic code to any 
of the sequences defined in (a), (b) or (c). 

2. A nucleic acid probe comprising a nucleic acid sequence as defined in claim 1 vAAch 
may be a polynucleotide or an oligonucleotide comprising at least IS nucleotides 
containing a detectable label. 

3. A recombinant vector or nucleic acid construct having incorporated therein the 
isolated nucleic acid molecule of claim 1 or a fiagment thereof. 

4. The vector of claim 3 which is an expression vector. 

5. A host cell which has been genetically engineered to incorporate therein the isolated 
nucleic acid molecule of claim 1 or the recombinant vector or nucleic acid construct of 
claim 3. 

6. The host cell of claim S having incorporated therein the expression vector of claim 4. 
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7. An assay kit comprising the isolated nucleic acid molecule or a fragment thereof of 
claim I or the probe of claim 2 in a suitable container. 



5 8. A method for producing a polypeptide functionally involved in cell division and. 
proliferation or a fragment thereof in a host cell comprising the steps 

(a) transferring the e3q}ression vector of claim 4 into a suitable host cell, and 

(b) cultivating the host cells of step (a) under conditions which will permit the 
expression of said polypeptide or fragment thereof and 

10 (c) optionally, secretion of the expressed polypeptide into the culture medium. 



9. Use of a probe as defined m claim 2 to isolate orthologues of genes comprising the 
nucleic acid sequences as disclosed in SEQ ID NO. 1 to 3, SEQ ID NO. 4 to 5, SEQ 
ID NO. 6 to 7, SEQ ID NO. 11. 

15 

10. Use of the isolated nucleic acid molecule or a fragment thereof as defined in claim 1 
for producing a polypeptide fiinctionally involved in cell division and proliferation or 
a fiagment thereof. 

20 11. Use of a nucleic acid molecule or a fragment thereof as defined in claim 1 or of the 
probe of claim 2 in a screening assay for interacting drugs that inhibit, stimulate or 
effect the cell division or proliferation. 



12. Use of a nucleic acid molecule as defined in claun 1 or of the probe of claim 2 in a 
25 method for diagnosis or treatment of diseases associated with anormalous and/or 

excessive cell division or proliferation. 



30 



The use of claim 12 wherein the disease is a coronary restenosis or a neoplastic disease 
selected fix>m the group consisting of lymphoma, lung cancer, colon cancer, ovarian 
canc^ and breast cancer. 
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14. A polypeptide functionally involved in cell division and proliferation or a fragment 
thereof comprising an amino acid sequence selected from the group consistmg of: 

(a) the amino acid sequences depicted in SEQ TD NO. 8, 9, 10, 11 and 13 and 
5 fragments thereof 

(b) amino acid sequences vMch exhibit a sequence identity with the sequences of (a) 
of at least 25 % over 100 residues and/or which are detectable in a computer aided 
search using the BLAST sequence analysis programs witii an e-value of at most 
10-^^ 

10 (c) amino acid sequences encoded by any of the nucleic acid sequences (c) - (d) as 

defined in claim 1. 

15. A fusion protein comprising the polypeptide or fragment thereof of claim 14. 



15 16. An antibody or a fragment thereof capable of specifically binding with the polyp^tide 
of claim 14 or with an immunogenic part thereof. 

17. A humanized antibody capable of specifically binding with the polypeptide of claim 
14 or with an immunogenic part thereof. 

20 

18. An assay kit comyprising the polypeptide as claimed in claim 14, the frision protein as 
claimed in claim IS, or the antibodies as claimed in claims 16 and/or 17 in a suitable 
container. 

19. Use of the polypeptide of claim 14, of the fusion protein of claim 15, or of the 
25 antibodies of claims 16 or 17 in a screening assay for interacting drugs that inhibit, 

stimulate or efTect the cell division or proliferation. 



20. 



The use of a polypeptide or of an antibody as claimed in claim 19 wherein the 
screening assay for interacting drugs comprises the following steps: 
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1 . recombinant expression of said polypeptide in a host ceU 

2. isolation and optionally purification of the lecombinantly expressed 
polypeptide of step 1 

3 . optionally labelling of the drugs that are tested to interact with said polypeptide 
and/or labelling of the recombmantly expressed polypeptide 

4. inunobilization of the recombinantly expressed polypeptide to a solid phase 

5. binding of a potential interaction partner or a variety thereof to the polypq>tide 

6. optionally one or more washing steps 

7. detection and/or quantification of the interaction, in particular by monitoring 
the amount of label remaining associated with the solid phase over background 
levels. 

21. Use of the polypeptide of claim 14, of an amino acid sequence as defined in claim 14 
or of the antibodies of claims 16 or 17 in a method for diagnosis or treatment of 
diseases associated with anomalous and/or excessive cell division or proliferation. 

22. The use of claim 20 wherein the disease is a coronary restenosis or a neoplastic disease 
selected &om the group consisting of lymphoma, lung cancer, colon cancer, ovarian 
cancer and breast cancer. 

22. Use of the nucleic acid sequences as defined in claim 1 or the amino acid sequences as 
defined in claim 14 for developing computational models, structural models or other 
models for evaluating drug binding and efBcacy. 
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FIG. 2 
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FIG. 3 



wo 02/38805 



PCT/EPOl/13034 




FIG. 4 



wo 02/38805 



44 



PCT/EPOl/13034 



Multiple Sequence Alignment of the H38K22.2a family 



CeH38K22.2a 

CeH38K22.2b 

nmCG7427 

MmAAF04863 

HSAAH09478 



CeH38K22.2a 

CeH38K22.2b 

nmCG7427 

MmAAF04863 

H8AAH09478 



CeH38K22.2a 

CeH38K22.2b 

DmCG7427 

M[nAAF04e63 

HSAAH09478 




CeH38K22.2a 

CeH38K22.2b 

DmCG7427 

MinAAF04863 

HSAAH09478 



CeH38K22.2a 

CeH38K22.2b 

nmCG7427 

MmAAF04863 

HSAAH09478 





iFGC^STIMTOSlDpWAQENAAASRLAQNVGASNAKQFKSVWISp 

S(^FKFLDl|cC^SEKHKRAIS 

,Nd8FKFLDI^W™i.EHHKRSIP 



IillskpdiHd; 

lTNIDD] 
ISMIADl 
ITMIADl 




gYCRENLNYPKPGNASNDQQMBTPKIAQ 
CRENLNYPKPGNASNDQQMETPKIAQ 
CQENDHLKEDSSPASGY(KK2SSASSS 

PQIAGTKSTTV 

IFARPQIAGTKSTTV 



CeH38K22.2a 

CeH38K22.2b 

DmCG7427 

MmAAF04863 

HSAAH09478 



KKPGIFYFNSNLQLIEFKLFQYPMLKTIFKITIHTAGTNR 
KKPGIFYFNSNLQLIEFBOiFQYPMLKTIFKITIHTAGTNR 
SQKNISSAYQTSHSTNMNYG 




CeH3.8K222a 



E-value: le-49 
IdentiUes: .101/275 (36%) 
Positives: 158/275 (56%) 




M 

E-value: 7e-49 
Identities: 1007275 (36%) 
Positives: 157/275 (56%) 




E-^alue: '6e-44 
Identities: 104/299 (36%:) 
Positives: 154/299. (S'6ft) 




FIG. 5 
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CE61773US.ST25 
SEQUENCE LISTING 

<110> Cenix Bioscience GmbH 

<120> Eukaryotic cell division genes and their use in diagnosis and trea 
tment of proliferative diseases 

<130> CB617730S 

<150> us 60/246,750 
<151> 2000-11-09 

<160> 26 

<170> Patent In version 3.1 

<210> 1 

<211> 3104 

<212> DNA 

<213> C. elegans 

<400> 1 



atgaatcgac 


tgaagtccga 


tcaaaaaaca 


aaggtttgta 


aacggaaaca 


agacgatgaa 


60 


gtggagatga 


gtgatatgga 


aactgatcac 


aaaaagtgta 


gaaaacaaga 


aaacagtaaa 


120 


tttgtgcgtg 


tgaaaattcc 


attcgtcatc 


cattcccgtt 


tttctctttt 


tcagcattta 


180 


tctcgagcaa 


gttcgagttc 


tctagctcaa 


agcactgttc 


tttctgacat 


ttttcccaag 


240 


aactacgata 


atatcgtgag 


ttgtagcggg 


aatttcgaaa 


aaaaaactaa 


ttttgccaca 


300 


tcttgctgct 


tcgtttgtta 


tttcttgact 


agacaaattc 


tagctcatct 


agaaagctga 


360 


cttttctcaa 


aatcgttgcg 


agacccaaag 


cagaaaaatg 


tatctttttt 


aaatctacgt 


420 


ggaaacgcgc 


tccaatatta 


aatttcgagg 


ttttcccgcc 


aaatacctaa 


cgagacccaa 


480 


ctttggcgag 


cagagcgttt 


tgcccgcgat 


tttcctgcgt 


ctcttcaaac 


aatctaatca 


540 


ctgctgctgg 


tttatgaaat 


atcaattttc 


ctcatttttt 


aaagctgagc 


aatgttttcg 


600 


ctcaatccta 


aaatttttag 


tagttctaat 


tgtgatcaac 


ggtttcccat 


ttccgatcga 


660 


agtcactttt 


taaattctca 


cttttattga 


tttttttcgt 


tttgaaattc 


ctgatttctt 


720 


cctttttagt 


gataagacat 


cagttgctga 


ctgtagagaa 


agtgtgagaa 


actgttagtg 


780 


agagagagaa 


aacagtttga 


gaaaatgaaa 


aatgttttaa 


ataatgatat 


cataattatt 


840 


atttgatacc 


atttccagct 


ccggcagttc 


gtccagtgga 


ctcaggtcac 


ggaagctgtg 


900 
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tctctcaact 


tcctggcaaa 


CE61773US.ST25 
agctaattgg aatatcgaat 


acgcgatgac 


tctgtatttc 


960 


gacaatccta 


atctttttgc 


taaaticaaca 


ccacaacccra 


gcgttgatag 


gtccaatgta 


1020 


cggcaattgc 


tgactttggc 


aactictacaa 


aataatiaalia 


ttctcacaat 


atttttaatt 


1080 


aaaatttagt 


tatatttaga 




aatatttcrat 


ttatctgaaa 


atacatttta 


1140 


tttcagttgg 


aataattgga 


a a a cr t cf e i" f» "t" 


f»a a a i" a a t" i" rr 


tttttgagcg 


cttttttaat 


1200 


tgttccaact 


gaaatcaaag 


o a 1" ^ 1" "t" f a rr 


a^aaarrcaaa 


tttttttaaa 


gtatatcact 


1260 


aagttttaat 


tctaaaaaag 


"t" a 1" 1" rr fi rr 3 rr ;^ 


a r'at'rf"t"oar»a 


ccgactcatt 


ttgttgaatt 


1320 


gccgacaatt 


gcagaattta 


aoi u u caau eel 


Uddd Uddd 


agtaattttt 


gtagatcgag 


1380 


cgcctcttca 


atcagtatgt 


r^fra r^ooa a a n 


rra ^ a a a ^ n 
^d udddy u ty 


gagaaaaacg 


aatgggaccc 


1440 


cacggaatca 


atcgtttgct 


1^ a f rra +■ ft" ^ 


nrrr'f" a ^ na a rf 

v.* Lrd UUdd^ 


ctactgatcg 


ccgggttctt 


1500 


gtgctcgcct 


ggaagtttac 


i" nr*^ r'a rt a r'a 


a"hrr+"rraa^ 
^dd uy c^dd L. 


tctcgttgga 


tgaatgggtg 


1560 


aaaggaatga 


cagctcttca 




y L ULfdddd L> L> 


tgagacaacg 


aatcgattcg 


1620 


attaattcag 


gactggaatc 




a a a rr +• a rta 
ddd^ L.dL*y (jd 


aaaaattaaa 


taactggaat 


1680 


tatcttccaa 


acttatttga 




rrorra a +* rrr^ 
y dd L U 


actttttaag 


aacaaattca 


1740 


cgcaaaacac 


tgtaaattga 


S /^+* ^ Si ^ ^ 2i 

cigx. L a a L. I. gel 


ddddu u u ugd 


tgtaaaatac 


agagaaaaat 


1800 


tacacacttt 


tcctcgagga 




U w u d d d 1^ 


aacacatagc 


tttattgttg 


1860 


gttcacacca 


cggcagtatg 


a^aa^f^aaaa 
d Ltdd L>wdddd 


aaaaaa^1*^a 
dddddd u l Ud 


attgaaaaat 


tgaaattaag 


1920 


atggaggaaa 


atgttatttc 


rf a +* ol" nrra a a 


taa tat 1"i-ai- 


ttttgtgaaa 


attaataaat 


1980 


ataattttca 


gaccgaagga 


aaai"i"'t"'t"aal" 


apcrtt t pt at 


aataattttc 


gattcaaaaa 


2040 


tttgaattat 


cacaattttt 


aaaaacaaaa 


acfcrt t ct a pfT 


atcgtctcat 


atctaatatc 


2100 


ttatcagtta 


cagttccacg 


agctctacct 


atttgccttc 


aactatgcca 


aatccgccgc 


2160 


ttgccgcaat 


ctggatcttg 






gatgttcttt 


tcggacaacg 


2220 


atcaacaatt 


atgactcaat 


ggatcgattt 


tctatgggca 


caggagaacg 


cggcggcgtc 


2280 


tcgcctcgct 


cagaacgtgg 


gcgcttccaa 


tgcgaagcaa 


ttcaaatcgg 


tgtggatctc 


2340 


tcgtgacacg 


tggaatctct 


tctgggactt 


tattcttctg 


agtaagccag 


atttgtcgga 


2400 


ttacgatgat 


gaaggagcat 


ggccagtgct 


tattgatcaa 
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ttcgttgatt 


attgccgtga 


2460 
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aaatctcaat 


tatccaaagc 
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caggaaatgc gtcaaatgat 


cagcaaatgg agacaccaag 


2520 


ttattattag 


gacaaacaat 


tctaaaatcc 


taaaggttcg 


tgtttcccca atttcttcct 


2580 


attttcagag 


ttataaaata 


ttgcctggac 


acaaaatttt 


gcttcaaaac tacggtacca 


2640 


ggtctcggca 


cgacaaatat 


liaalitiaaalia 


caaaaaliaca 


cgcgccttca atgggtactg 


2700 


tagtttcaca 


cttttcaaaa 


cattaatttt 


tctatoacaa 


cagataagct ttaaaaaatc 


2760 


ttgtgaaaaa 


cttcaaaaaa 


tcaaaagttt 


gaaggcgcac 


atattttaac aaaaaatgtt 


2820 


tcgtgccgag 


accggctacc 


gtatttttta 


tgcgaaattt 


cgcgtttgtg taatattttt 


2880 


atattatacc 


gagaaaactc 


gacactttaa 


aggtgtggta 


gcgaattggg attttatttc 


2940 


gaaaaatatc 


ctaaatattc 




a a a rrrrrfa 
acici uo^^^ wci 


aaagaaaccc ggaatttttt 


3000 


attttaattc 


taatttacaa 


ctaatagaat 


tcaaattgtt 


tcagtatccc atgctcaaga 


3060 


ctatcttcaa 


aataacaatt 


cacaccgccg 


gaacaaatcg 


ataa 


3104 


<210> 2 
<211> 1011 
<212> DNA 
<213> Q. elegans 










<400> 2 
atgaatcgac 


tgaagtccga 


i» wCi CI a CI CI o vi» a 


CI Ci ^ ^ C Vi> Vi* ^ ^ w 


agttcgtcca gtggactcag 


60 


gtcacggaag 


ctgtgtctct 




noaa a arff*^ a 

^ V»CI Cl o o ^ ^ L« Cl 


attggaatat cgaatacgcg 


120 


atgactctgt 


atttcgacaa 


i" "t" P a "t" o 1" "t* 


i" "t" t* ff f "h n cr a "t* 


cgacaccaca gccgagcgtt 


180 


gataggtcca 


atatcgagcg 




c a cf "t" a i" n t" cr 


acccaaagga taaagttgga 


240 


gaaaaacgaa 


tgggacccca 


cacraatcaat 


ccrtttoctca 


ctgatcttgg ctatgaagct 


300 


actgatcgcc 


gggttcttgt 


act.cocct.aa 


aaotttacta 


cacagacaca atgtgaattc 


360 


tcgttggatg 


aatgggtgaa 


aggaatgaca 


gctcttcaag 


cggatactgt tcaaaatttg 


420 


agacaacgaa 


tcgattcgat 


taattcagga 


ctggaatcgg 


ataaggcaaa attccacgag 


480 


ctctacctat 


ttgccttcaa 


ctatgccaaa 


tccgccgctt 


gccgcaatct ggatcttgaa 


540 


actgccatct 


gttgctggga 


tgttcttttc 


ggacaacgat 


caacaattat gactcaatgg 


600 


atcgattttc 


tatgggcaca 


ggagaacgcg 


gcggcgtctc 


gcctcgctca gaacgtgggc 


660 


gcttccaatg 


cgaagcaatt 


caaatcggtg 


tggatctctc 
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gtgacacgtg gaatctcttc 


720 
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tgggacttta ttcttctgag 


taagccagat ttgtcggatt 


acgatgatga 


aggagcatgg 


780 


ccagtgctta ttgatcaatt 


cgttgattat tgccgtgaaa 


atctcaatta 


tccaaagcca 


840 


ggaaatgcgt 


caaatgatca 


gcaaatggag acaccaaaaa 


tagcgcaaaa 


gaaacccgga 


900 


attttttatt 


ttaattctaa 


tttacaacta atagaattca 


aattgtttca 


gtatcccatg 


960 


ctcaagacta tcttcaaaat 


aacaa't'tcac accaccaoaa 


caaatcaata 


a 


1011 


<210> 3 
<211> 852 
<212> DNA 
<213> C. elegans 










<400> 3 
atgaatcgac 


tgaagtccga 


tcaaaaaaca aagatcgagc 


gcctcttcaa 


tcagtatgtc 


60 


gacccaaagg 


ataaagttgg 


agaaaaacga atgggacccc 


acggaatcaa 


tcgtttgctc 


120 


actgatcttg 


gctatgaagc 


tactgatcgc cgggttcttg 


tgctcgcctg 


gaagtttact 


180 


gcacagacac 


aatgtgaatt 


ctcgttggat gaatgggtga 


aaggaatgac 


agctcttcaa 


240 


gcggatactg 


ttcaaaattt 


gagacaacga atcgattcga 


ttaattcagg 


actggaatcg 


300 


gataaggcaa 


aattccacga 


gctctaccta tttgccttca 


actatgccaa 


atccgccgct 


360 


tgccgcaatc 


tggatcttga 


aactgccatc tgttgctggg 


atgttctttt 


cggacaacga 


420 


tcaacaatta 


tgactcaatg 


gatcgatttt ctatgggcac 


aggagaacgc 


ggcggcgtct 


480 


cgcctcgctc 


agaacgtggg 


cgcttccaat gcgaagcaat 


tcaaatcggt 


gtggatctct 


540 


cgtgacacgt 


ggaatctctt 


ctgggacttt attcttctga 


gtaagccaga 


tttgtcggat 


600 


tacgatgatg 


aaggagcatg 


gccagtgctt attgatcaat 


tcgttgatta 


ttgccgtgaa 


660 


aatctcaatt 


atccaaagcc 


aggaaatgcg tcaaatgatc 


agcaaatgga 


gacaccaaaa 


720 


atagcgcaaa 


agaaacccgg 


aattttttat tttaattcta 


atttacaact 


aatagaattc 


780 


aaattgtttc 


agtatcccat 


gctcaagact atcttcaaaa 


taacaattca 


caccgccgga 


840 


acaaatcgat 


aa 








852 



<210> 4 
<211> 3308 
<212> DNA 
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<213> C. elegans 
<400> 4 



atgtcgatgg 


agcctcgtaa 


gaagcggaac 


tcgattctca 


aggtgcggca 


agccgtcgaa 


60 


accatcgagg 


aaaccgtcat 


gaacagtggg 


cctagttcca 


caacaactaa 


tcgacgagtc 


120 


agctttcata 


acgtgaagca 


tgtcaagtca 


gttagagtca 


gtgaataatt 


tatcaataaa 


180 


ataattattt 


caggcagtat 


gacagggacc 


atggtaaaat 


tcttgacgcc 


acaccagtta 


240 


aggagaagat 


tactgacact 


attggatcag 


atggtatttt 


gacgtgagtt 


ccatccttta 


300 


acgtgaaata 


atgaatacgt 


aaaaatcttt 


ttaagaccac 


gtggcggaaa 


catggatatt 


360 


tccgaatctc 


cggcctgcac 


gtcctcattt 


caagtgttcg 


gcggtggtaa 


tctcgataaa 


420 


actatggata 


tgtctctcga 


aacaactatc 


aacgagaaca 


acgaaacggc 


gagattgttt 


480 


gaaaccacaa 


gagatccaac 


actattatac 


gaaaagatcg 


tcgaaaccac 


aacaaaagtt 


540 


accgagcgaa 


ttgttagtat 


gccactggat 


gataccttag 


caatgttcaa 


tacaacgaat 


600 


caagaagata 


aggatatgtc 


agttgatcgt 


tcagttcttt 


tcacgattcc 


caaagttccg 


660 


aagcataacg 


ctacaatgaa 


tagaactata 


ccgatggacc 


tcgatgaatc 


aaaagcagcg 


720 


ggcggccagt 


gcgatgaaac 


ggtatgttga 


attaatagaa 


ggaaccaaat 


tatcttaatt 


780 


ttacagatga 


atgtgttcaa 


tttcacaaac 


ttggaagccg 


ctgaaatgga 


tacgagtaaa 


840 


ttagatgaaa 


ataataccat 


gaatgctatc 


cggattccga 


ttaattcaaa 


cgtcatgcct 


900 


gtagacatgg 


acatcactga 


acatcacact 


ttaattgaag 


aaaagaaaaa 


tgatacattc 


960 


gggccaagtc 


aactgatgga 


catttcggcg 


ccacaagttc 


aagttaatga 


tactttggcc 1020 


attttcaaca 


gtccgagaga 


catctgtaat 


aagggtttgg 


gtgttcctca 


gaatctaata 


1080 


aatatcgcct 


cgaacgtcgt 


acctgtggac 


atggacatca 


ctgatcaggc 


cgtattaaac 


1140 


gcggagaaga 


aaaatgatca 


attcgagaca 


agtcagctta 


tggacatttc 


tattccgaaa 


1200 


gttctagtaa 


atgacactat 


ggcgatgttc 


aacagcccga 


aacacgtcag 


taagagcagc 


1260 


atggatctcg 


agaaaacgat 


tgaagccgct 


gacaaatcaa 


cgaaataccc 


gagtatcgca 


1320 


gatgaggtgg 


aagatttaga 


catggatatg 


gatatcactg 


aacaacaacc 


atgtgaggct 


1380 


ggtaatcagc 


agaacgacgg 


cttgcaactt 


caaaaggagg 


atttaatgga 


catttcggtg 


1440 


attcgagatt 


cacctgcagt 


aaacgacacc 


atggctgtgt 
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tccagagtcc 


tgccagagta 


1500 



wo 02/38805 



PCT/EPOl/13034 



CE61773US.ST25 



aagatcggag 


cggtaagttt 


taagcacact 


ttccaataaa 


aatgtatttc 


tttcagaaca 


1560 


actcgatcat 


tgattcgcag 


aaatctatcg 


tgttcggtga 


cgaaatgagc 


attgacgaga 


1620 


cacaaaatga 


tggaaccttg 


acgttgccaa 


agtcgaatgt 


agaagtgact 


acaactaatg 


1680 


atgtctacac 


gtctctcgag 


cggcaagagg 


aaaatgcttc 


agaaaacgta 


tccatgataa 


1740 


acgaatcttc 


tgttcattcg 


gaaatcgaca 


aaaagtcgtt 


tatgctcatc 


gaagaagaaa 


1800 


gggcttttat 


gcactcctcc 


atgattgatg 


tagcacaaaa 


gttggaagac 


gatggttcgt 


1860 


cgaagacgcc 


agtcatcctt 


gcttcacagt 


cagcttctct 


tgccactaaa 


gaaccatcag 


1920 


cccttcacaa 


ctcgagtgca 


actctcaaca 


attcgatgga 


attggacaac 


aatactcttc 


1980 


ttaaaactat 


gcaaattaca 


acgtgtgaag 


acattagcat 


ggtccatgag 


tctattgctg 


2040 


ttgaactgaa 


cagtaacaaa 


gagcaggagc 


aattcggaga 


tgagactttg 


cagaaaaatg 


2100 


gtaaatttcg 


tttattcaat 


aactctatta 


aaagtatgtt 


ttagatacct 


cgaatactgg 


2160 


cgcgaatttc 


acattccaag 


gccataatga 


aacatcgcaa 


atcatgaaca 


atgtcgactc 


2220 


ggaagcagtg 


aacacgtcca 


agatttcaac 


atattcggct 


ttcaatttga 


gcatcaacca 


2280 


gtctatctct 


aaacgacgtc 


gatctcttct 


gaattctgct 


cgtgaatctc 


ctcgtcgtgt 


2340 


tgcgttggag 


aattctataa 


tgtcgatgaa 


tgggcaaaca 


atggaagctc 


tgacagaata 


2400 


tcgacagaat 


aaaactatgc 


agacgagtca 


agattcgatg 


ccgagtatga 


gtttgaacga 


2460 


ttcgggaaga 


gatattctcg 


cgatggtaag 


aatatctctt 


tgagtattga 


atcgaaaatg 


2520 


tctttcagaa 


tacatcagtc 


cgctctcctc 


atctgaattc 


ttcaaaaact 


gctgccccag 


2580 


gaacaccatc 


attgatgtca 


caaaatgtac 


aacttccacc 


tccatctcct 


caattcgaaa 


2640 


tgccagactt 


cgatccagct 


gtggtcaacg 


ttgtatattt 


aacatctgaa 


gatccgtcca 


2700 


ctgaacaaca 


tccagaagct 


ctcaaatttc 


agcgtattgt 


tgaaaacgag 


aaaatgaaag 


2760 


tacaacacga 


gattgattct 


ctgaattcaa 


ccaatcaact 


ttctgctgag 


aaaattgata 


2820 


tgttgaagac 


taaggagctc 


ttgaagttta 


gtcatgatga 


gcgagaagcg 


attatgattg 


2880 


caagaaaaga 


cgcggaaatc 


aagtttttgg 


agcttcgtct 


gaaatttgca 


ctcgagaaaa 


2940 


aaattgaaag 


tgaccaggaa 


attgctgaac 


tagaacaagg 


aaattcgaaa 


atggctgagc 


3000 


agctaagagg 


tctcgataag 


atggctgtcg 


ttcaaaaaga 
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actagaaaag 


ctgagaagtc 


3060 



wo 02/38805 



PCT/EPOl/13034 



CE61773US.ST25 



ttcctccatc 


acgcgaagag 


agcgggaaaa 


tccgaaagga 


gtggatggag 


atgaagcaat 


3120 


gggaattcga ccagaaaatg 


aaagcactcc 


gaaatgtacg 


ctcaaacatg 


attgcacttc 3180 


gttcagagaa aaatgctctc 


gaaatgaaag 


tcgcggaaga 


acacgagaag 


tttgcccaga 


3240 


ggaacgattt gaagaaaagt 


cgaatgctgg 


tgttctctaa 


ggctgttaag 


aaaatlig'tga 


3300 


acttctag 












3308 


<210> 5 
<211> 3033 
<212> DNA 
<213> C. elegans 












<4UU> D 

atgtcgatgg 


agcctcgtaa 


gaagcggaac 


tcgattctca 


aggtgcggca 


agccgtcgaa 


60 


accatcgagg 


aaaccgtcat 


gaacagtggg 


cctagttcca 


caacaactaa 


tcgacgagtc 


120 


agctttcata 


acgtgaagca 


tgtcaagcag 


tatgacaggg 


accatggtaa 


aattcttgac 


180 


gccacaccag 


ttaaggagaa 


gattactgac 


actattggat 


cagatggtat 


tttgacacca 


240 


cgtggcggaa 


acatggatat 


ttccgaatct 


ccggcctgca 


cgtcctcatt 


tcaagtgttc 


300 


ggcggtggta 


atctcgataa 


aactatggat 


atgtctctcg 


aaacaactat 


caacgagaac 


360 


aacgaaacgg 


cgagattgtt 


tgaaaccaca 


agagatccaa 


cactattata 


cgaaaagatc 


420 


gtcgaaacca 


caacaaaagt 


taccgagcga 


attgttagta 


tgccactgga 


tgatacctta 


480 


gcaatgttca 


atacaacgaa 


tcaagaagat 


aaggatatgt 


cagttgatcg 


ttcagttctt 


540 


ttcacgattc 


ccaaagttcc 


gaagcataac 


gctacaatga 


atagaactat 


accgatggac 


600 


ctcgatgaat 


caaaagcagc 


gggcggccag 


tgcgatgaaa 


cgatgaatgt 


gttcaatttc 


660 


acaaacttgg 


aagccgctga 


aatggatacg 


agtaaattag 


atgaaaataa 


taccatgaat 


720 


gctatccgga 


ttccgattaa 


ttcaaacgtc 


atgcctgtag 


acatggacat 


cactgaacat 


780 


cacactttaa 


ttgaagaaaa 


gaaaaatgat 


acattcgggc 


caagtcaact 


gatggacatt 


840 


tcggcgccac 


aagttcaagt 


taatgatact 


ttggccattt 


tcaacagtcc 


gagagacatc 


900 


tgtaataagg 


gtttgggtgt 


tcctcagaat 


ctaataaata 


tcgcctcgaa 


cgtcgtacct 


960 


gtggacatgg 


acatcactga 


tcaggccgta 


ttaaacgcgg 


agaagaaaaa 


tgatcaattc 


1020 
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gagacaagtc 


agcttatgga 


CE61773US.ST25 
catttctatt ccgaaagttc 


tagtaaatga 


cactatggcg 


1080 


atgttcaaca 


gcccgaaaca 


cgtcagtaag 


aQcacrcatQQ 


atctcgagaa 


aacgattgaa 


1140 


gccgctgaca 


aatcaacgaa 


atacccgagt 


atcgcagatg 


aggtggaaga 


tttagacatg 


1200 


gatatggata 


tcactgaaca 


acaaccatgt 


aaaactaata 


atcagcagaa 


cgacggcttg 


1260 


caacttcaaa 


aggaggattt 


aa'tggacat't 


tcggtgattc 


gagattcacc 


tgcagtaaac 


1320 


gacaccatgg 


ctgtgttcca 


aacrticcti acc 


a cr a crii a a acr a 


tcggagcgaa 


caactcgatc 


1380 


attgattcgc 


agaaatctat 


p ri t" CT 1" "t" p cr cr i" 


era poa a a 1*03 


gcattgacga 


gacacaaaat 


1440 


gatggaacct 


tgacgttgcc 


A a a cri" crra a "t" 


crl" a na a oi" na 

WrCiy ClClVj wV^O 


ctacaactaa 


tgatgtctac 


1500 


acgtctctcg 


agcggcaaga 




pa rra a a a pfr 

i> ^ a y a a ci a wy 


tatccatgat 


aaacgaatct 


1560 


tctgttcatt 


cggaaatcga 


a a a a ct^ r*n 


^' ^ i" a irf p^ pa 


tcgaagaaga 


aagggctttt 


1620 


atgcactcct 


ccatgattga 




a a rr^ ^ rrrra a rr 


acgatggttc 


gtcgaagacg 


1680 


ccagtcatcc 


ttgcttcaca 


n1" pa fTP"t" i* pf" 


pi"t' fipp a pi" a 


aagaaccatc 


agcccttcac 


1740 


aactcgagtg 


caactctcaa 




aaattcraaca 


acaatactct 


tcttaaaact 


1800 


atgcaaatta 


caacgtgtga 


aaacattaac 


atacrtccata 


agtctattgc 


tgttgaactg 


1860 


aacagtaaca 


aagagcagga 




aatcracractt 


tgcagaaaaa 


tgatacctcg 


1920 


aatactggcg 


cgaatttcac 


a't ti ccaaaac 


catiaatiQaaa 


catcgcaaat 


catgaacaat 


1980 


gtcgactcgg 


aagcagtgaa 


caccrtccaaa 


atiiiticaacati 


attcggcttt 


caatttgagc 


2040 


atcaaccagt 


ctatctctaa 


accraccrtccra 


tctcttctcra 


attctgctcg 


tgaatctcct 


2100 


cgtcgtgttg 


cgttggagaa 


ttctataata 


ticaaticraalici 


ggcaaacaat 


ggaagctctg 


2160 


acagaatatc 


gacagaataa 


aactaliacaa 


acaacT'tcaaa 


attcgatgcc 


gagtatgagt 


2220 


ttgaacgatt 


cgggaagaga 


tattctcgcg 


atgaatacat 


cagtccgctc 


tcctcatctg 


2280 


aattcttcaa 


aaactgctgc 


cccaggaaca 


ccatcattga 


tgtcacaaaa 


tgtacaactt 


2340 


ccacctccat 


ctcctcaatt 


cgaaatgcca 


gacttcgatc 


cagctgtggt 


caacgttgta 


2400 


tatttaacat 


ctgaagatcc 


gtccactgaa 


caacatccag 


aagctctcaa 


atttcagcgt 


2460 


attgttgaaa 


acgagaaaat 


gaaagtacaa 


cacgagattg 


attctctgaa 


ttcaaccaat 


2520 


caactttctg 


ctgagaaaat 


tgatatgttg 


aagactaagg 


agctcttgaa 


gtttagtcat 


2580 
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gatgagcgag aagcgattat 


gattgcaaga 


aaagacgcgg aaatcaagtt tttggagctt 


2640 


cgtctgaaat ttgcactcga 


gaaaaaaatt 


gaaagtgacc aggaaattgc tgaactagaa 2700 


caaggaaatt cgaaaatggc 


tgagcagcta 


agaggtctcg ataagatggc tgtcgttcaa 


2760 


aaagaactag aaaagctgag 


aagtcttcct 


ccatcacgcg aagagagcgg gaaaatccga 


2820 


aaggagtgga tggagatgaa 


gcaatgggaa 


ttcgaccaga aaatgaaagc actccgaaat 


2880 


gtacgctcaa acatgattgc 


acttcgttca 


gagaaaaatg ctctcgaaat gaaagtcgcg 2940 


gaagaacacg agaagtttgc 


ccagaggaac 


gatttgaaga aaagtcgaat gctggtgttc 


3000 


tctaaggctg ttaagaaaat 


tgtgaacttc 


tag 


3033 


<210> 6 
<211> 7097 
<212> DNA 
<213> C. elegans 








accgcatctc 


ttccaatgga 


tcaaccatca 






cccgcacctt 


ccgttgctga 


agagcatggc 


Odwdy uy y dy Wrfdwy L« L>y d dydd^ddtj^dd 




gacaatgaca 


cggatgaagt 


atctgcaatg 


u u L. L. uy uy v«v.« uy d u^d dwc? L* dwu 


1 80 


cttgttaatt 


cagatcatga 


attgtctgat 


^dU^L^UwUdd d^udUddddd uy s.*d^v^L>y v.*v^ 


240 


gaattcaaag 


cttttgagag 


aagaatggat 


tcggtaagaa cagccaaatc agaatgataa 


300 


ttgaaatttt 


acatagaata 


gatttacgta 


tcaaaaatca aaacctacga atactctcta 


360 


attcaaaatt 


taattaatta 


aaattaaaga 


tgagatcagc ttcaacaatc acaacatcac 


420 


tggcaacgcc 


atcatcttgt 


gcaccatcaa 


actcctctga gcctcctact cggtctacac 


480 


caati iLalzgaa 


cgatttaggc 


gttggcccaa 


ataatcacaa ttggccgtct tcaatgcaag 


540 


aattatcagg 


aatttctctg 


gaaacaccac 


aggctcgacc gcttggcagc aatagaatta 


600 


atcagcttgg 


taggttaata 


acaaaaaaaa 


catgattgat tagattttta gttcgaagtg 


660 


aggctcaaac 


gggaataagc 


cttttacaac 


accatgaaag acctactgtg accgccccat 


720 


tgagacgaaa 


tgatatgatg 


aactcatcac 


gacagaatcc acagaatgga aatgttcaag 


780 


atgaaaatcg 


acccgagcac 


gtttatgatc 


aaccaataca tgttcctgga tcatcactgg 


840 


accgacagaa 


acttgaaatt 


gaaattcgac 


gtcatcgtaa cttgaacata caactgagag 
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aactctgccc 


gaaccgtgga 


cagtggaata 


tcaaatgaag 


acgagacccg 


tccaccaacc 


5640 


aatagctaac 


ggtcttcttt 


tcgaatattc 


caatggagat 


cttcgatggg 


ttaatcggca 


5700 


gaacgctgtt 


aatgtaagtt 


ttaattggaa 


ttttgtcaat 


taaagtgacc 


aatttacaga 


5760 


tctacatatc 


cgcagttgat 


aaaacagtca 


gaattgatct 


ccccacatac 


aatatttcaa 


5820 


ttattcatac 


atttcaaagg 


caagttgaag 


tacttcgtcc 


tggaaataac 


ataacattga 


5880 


taagtattaa 


acgacgagaa 


gttcgaactg 


atttgattta 


tcaaaacgga 


atgtataaaa 


5940 


ctgaaatgta 


agattatttt 


cttttttaaa 


gttcatcgga 


aatttcgtat 


ttcagcttca 


6000 


atagggacgg 


aagatatgtt 


acgaaggatt 


ttagcaatca 


agaagtttcg 


agaaagtgag 


6060 


tctctattca 


ttttccaatt 


aattattcag 


aaaaaccatt 


aaaatctcaa 


aactattacc 


6120 


cagatgttta 


atttaattta 


atttaattta 


ttacgataga 


agatatcgtt 


aggtagaaaa 


6180 


aaaaacacac 


acacattaat 


agatacaaac 


catcacaagt 


ggttacataa 


ataaattaca 


6240 


taaataaaac 


gaaacaaaaa 


taaaaaaaga 


gatgtgacat 


tttgcggcaa 


aaaatgtctc 


6300 


ggcacgataa 


aatttagtta 


aatgggaaaa 


ggcgtgcgcc 


tttaaatatt 


actgtagttt 


6360 


aaaaatcgcg 


ttactgtcga 


attgttgttt 


gccccttttt 


tttttgataa 


aaacatgttt 


6420 


attagtttag 


aaaaaagata 


aataaaccaa 


actacaacag 


tctttatagg 


cgcacgtatt 


6480 


ttcacattta 


aaaatctgtc 


ctttaacgaa 


aaaattgtaa 


aatttggcgc 


cttcaaagag 


6540 


tactgtaatt 


tcaaactcaa 


tttgaaacag 


aattttcatc 


gattttcctt 


agttagtttt 


6600 


tcgatgaatt 


ttaatttatt 


cattaaaaaa 


actcaaataa 


gtataacgat 


attttagcaa 


6660 


ataatatatt 


ttcaaacaaa 


acatgtttct 


ataatttttg 


tctaacccaa 


aatttaggaa 


6720 


tatgacctca 


attcttcaaa 


aagttagtaa 


aacaggttta 


aaaccccgtt 


ataaatattt 


6780 


ttgcctctga 


aacctatcaa 


attttcagat 


acaatcccgg 


tacacacaca 


tatcgcgaca 


6840 


atcaatgtcg 


ctacgttctc 


gtcactgatt 


acaacgattt 


tgagctcgtt 


gagccagaat 


6900 


tccgtcttcg 


ttggtatcag 


ggagatccga 


ctggtctcaa 


caatcagtat 


attctcaaga 


6960 


tcattggacg 


acctgaatgc 


agcgagaaaa 


cattgagact 


tgaagtgaat 


ctttccacgt 


7020 


gtgaaggtac 


attggaaact 


gcagagatga 


taggcgataa 


acgtcggaaa 


acaactttgt 


7080 


tccagtggaa 


aaaatga 
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<210> 7 
<211> 3624 
<212> DNA 
<213> C- elegans 



<400> 7 
atgtcaacaa 


tcaccagcca 


aaaagggata 


agattattaa 


ctgagagacg 


aggggataat 


60 


tccctcatac 


taactctcac 


tcttcactct 


ctctgctctt 


ctcctcattt 


gtcttctttt 


120 


tttgatattg 


gttgtggttt 


tttgtcaccg 


aataataaga 


atgctatgaa 


tacatctcac 


180 


aattcatttt 


tctttttctt 


gcttctcttc 


cttttttcgt 


tctttttgcc 


gtttgccatt 


240 


caactttttg 


gtaaattgcc 


aaattctaag 


aaaatgtggg 


ctttcccagc 


aattttgagc 


300 


ataaatgtaa 


atctaatttc 


tagaaagttg 


atggtcacag 


tgataccaaa 


aataataagt 


360 


tctccatatc 


ctcggacacg 


cctaccactg 


tacctctaca 


ctgtttccat 


cattatttcc 


420 


tgctctttat 


tatactggaa 


tcttctttac 


tgcaaaaatt 


atgactgtgt 


cgttgagaag 


480 


gaatttcgat 


ggggaagtac 


tcggcactta 


ctacagtact 


ttccggtgat 


agcagctccg 


540 


attataatgg 


taatatcgtt 


ttcttggtta 


ataattgcaa 


tatattattc 


aagtagttca 


600 


tgtgttctta 


cattcaattt 


tatggaaatg 


ccatctgcag 


tactttgttc 


tctacttggt 


660 


ggtattagtt 


ctgtaataga 


aattcatttt 


tccattgaag 


taaatcaagt 


tcaatggact 


720 


gatcagtggt 


tactgtcatc 


tgtgggttta 


ccaatcaacg 


attgtttaaa 


aatcgatatt 


780 


ttcagggatc 


ttcaatactt 


ttatgccttt 


tacatgctac 


aattgcgttc 


acacttcaat 


840 


aatccttcca 


acatatttga 


atttccaatc 


ttcttcaaat 


cgatgaatca 


aaaatattat 


900 


gtgaactgtg 


atatttactc 


ttgctcaatt 


catttcatga 


aaaagcaaaa 


gaaaatgagc 


* 960 


ttttcacaag 


cacaagacgt 


atatcttcgt 


ctgaagcaag 


aaaaagaaga 


ggagaaacaa 


1020 


cgagagcgag 


ccgaacgaga 


aaagcgaaat 


gagacgattg 


cagcgacaaa 


taaatcaaga 


1080 


aagaagatga 


atcaggcatt 


ggcaaaaaga 


aataaaaaag 


gacaaccaaa 


tctgaatgct 


1140 


caaatggata 


tggcttccga 


tgaaaatatc 


ggtgccgacg 


gtgaacagaa 


gccttctcgg 


1200 


ccgtttttga 


gaaaaggaca 


aggaacagca 


agatttagaa 


tggtagtttg 


tgcaaataca 


1260 


aggcttatcg 


aaataatata 


tgaagttcag 


cctagaaaca 


acaaaacatc 


tgctggtgca 1320 
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CE61773US.ST25 
atctgcttca agtccttcta 


ttaatgttcc 


taggtttagt 


1380 


ctgtcgaatg 


ctctcccgaa 


ctctgcccga 


accgtggaca 


gtggaatatc 


aaatgaagac 


1440 


gagacccgtc 


caccaaccac 


cgcatctctt 


ccaatggatc 


aaccatcatt 


gtcatcttcg 


1500 


ccggaaaatc 


gtctaaatcc 


cgcaccttcc 


gttgctgaag 


agcatggcca 


cagtggacag 


1560 


cacgctgaag 


aagaagaaga 


caatgacacg 


gatgaagtat 


ctgcaatgcc 


ttcttttgtg 


1620 


cctgatgaac 


cttcgactct 


tgttaattca 


gatcatgaat 


tgtctgatga 


tgctttaaag 


1680 


tataaaaatg 


cagctgccga 


alilicaaaQCti 


tttgagagaa 


gaatggattc 


gatgagatca 


1740 


gcttcaacaa 


tcacaacatc 


a ct crcrca a c cr 


ccali cat cbli 


gtgcaccatc 


aaactcctct 


1800 


gagcctccta 


ctcggtctac 




aacaaliti'taQ 


gcgttggccc 


aaataatcac 


1860 


aattggccgt 


cttcaatgca 




acraatttctc 


tggaaacacc 


acaggctcga 


1920 


ccgcttggca 


gcaatagaat 


1" ;5 1" fa Cf t 


crttccraacrtcr 


aggctcaaac 


gggaataagc 


198C 


cttttacaac 


accatgaaag 




accaccccat 


tgagacgaaa 


tgatatgatg 


204C 


aactcatcac 


gacagaatcc 


acacraaliacra 


aatiaiitcaao 


atgaaaatcg 


acccgagcac 


210C 


gtttatgatc 


aaccaataca 


tcfttcctacra 


tcatcactigg 


accgacagaa 


acttgaaatt 


216C 


gaaattcgac 


gtcatcgtaa 


cttcraacata 


caactiQaaaG 


acactattgc 


tcacttggat 


222C 


tatgcagaag 


aatccgtgca 


caccacaaaa 


caacaactcQ 


aagaaaaaat 


ttccgaagtc 


228C 


aataatttta 


agaaagaact 


aaliaoaaaaa 


tttaagaaa't 


gcaaaaaagg 


agttgaggaa 


234C 


gaatttgaga 


agaagtttga 


gaaaattaag 


gaagattatg 


atgaacttta 


cgagaaattg 


240C 


aagagggatc 


aacgagatct 


tgaacgagat 


cagaagatat 


tgaagaaagg 


aacgggagaa 


246C 


aggaataaag 


aattcacaga 


aacga'tagcc 


actctccgcg 


acaaattaag 


agcatcagaa 


252C 


accaagaatg 


cacaatatcg 


acaggatata 


cgtgttcgag 


acgaaaagct 


caagaaaaaa 


258C 


gacgaggaaa 


tcgagaagct 


tcagaaagac 


ggaaaccggc 


taaagagcac 


tctacagact 


264( 


ttagaaaagc 


gcgtaaaaca 


attacgtact 


gaaaaagaac 


gcgacgataa 


agaaaaggag 


270( 


atgttcgcga 


aggttgcaat 


gaatcgaaaa 


acttcgaatc 


cagtgccacc 


agttttgaat 


276( 


caaagtgttc 


caatttcgat 


aacatcaaat 


ggtccatcta 


gacatccatc 


atcatcttcg 


282( 


ttgacaacat 


ttagaaaacc 


atctacatca 


aatcgagaaa 


gaggtgttag 


ttgggcagat 


288( 
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gaaccaaatg aacaatcatt ggaagctgta ccacaggagt ttttgatgat gccagtcaaa 294( 
gaaatgccgg gaaaatttgg aaaatgcacg atctacagag attctcttgg agaaacatct 300( 
aaagtgacgg atacaatagc taacggtctt cttttcgaat attccaatgg agatcttcga 306( 
tgggttaatc ggcagaacgc tgttaatatc tacatatccg cagttgataa aacagtcaga 312( 
attgatctcc ccacatacaa tatttcaatt attcatacat ttcaaaggca agttgaagta 318( 
cttcgtcctg gaaataacat aacattgata agtattaaac gacgagaagt tcgaactgat 324 C 
ttgatttatc aaaacggaat gtataaaact gaaatcttca atagggacgg aagatatgtt 330t 
acgaaggatt ttagcaatca agaagtttcg agaaaataca atcccggtac acacacatat 336t 
cgcgacaatc aatgtcgcta cgttctcgtc actgattaca acgattttga gctcgttgag 342i 
ccagaattcc gtcttcgttg gtatcaggga gatccgactg gtctcaacaa tcagtatatt 348i 
ctcaagatca ttggacgacc tgaatgcagc gagaaaacat tgagacttga agtgaatctt 354< 
tccacgtgtg aaggtacatt ggaaactgca gagatgatag gcgataaacg tcggaaaaca 360« 
actttgttcc agtggaaaaa atga 362* 

<210> 8 

<211> 336 

<212> PRT 

<213> C. elegans 

<400> 8 

Met Asn Arg Leu Lys Ser Asp Gin Lys Thr Lys Leu Arg Gin Phe Val 
15 10 15 



Gin Trp Thr Gin Val Thr Glu Ala Val Ser Leu Asn Phe Leu Ala Lys 
20 25 30 



Ala Asn Trp Asn lie Glu Tyr Ala Met Thr Leu Tyr Phe Asp Asn Pro 
35 40 45 



Asn Leu Phe Ala Gly Ser Thr Pro Gin Pro Ser Val Asp Arg Ser Asn 
50 55 60 



lie Glu Arg Leu Phe Asn Gin Tyr Val Asp Pro Lys Asp Lys Val Gly 
65 70 75 80 
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Glu Lys Arg Met Gly Pro His Gly lie Asn Arg Leu Leu Thr Asp Leu 
85 90 95 



Gly Tyr Glu Ala Thr Asp Arg Arg Val Leu Val Leu Ala Trp Lys Phe 
100 105 110 



Thr Ala Gin Thr Gin Cys Glu Phe Ser Leu Asp Glu Trp Val Lys Gly 
115 120 125 



Met Thr Ala Leu Gin Ala Asp Thr Val Gin Asn Leu Arg Gin Arg lie 
130 135 140 



Asp Ser lie Asn Ser Gly Leu Glu Ser Asp Lys Ala Lys Phe His Glu 
145 150 155 160 



Leu Tyr Leu Phe Ala Phe Asn Tyr Ala Lys Ser Ala Ala Cys Arg Asn 
165 170 175 



Leu Asp Leu Glu Thr Ala lie Cys Cys Trp Asp Val Leu Phe Gly Gin 
180 185 190 



Arg Ser Thr lie Met Thr Gin Trp lie Asp Phe Leu Trp Ala Gin Glu 
195 200 205 



Asn Ala Ala Ala Ser Arg Leu Ala Gin Asn Val Gly Ala Ser Asn Ala 
210 215 220 



Lys Gin Phe Lys Ser Val Trp lie Ser Arg Asp Thr Trp Asn Leu Phe 
225 230 235 240 



Trp Asp Phe lie Leu Leu Ser Lys Pro Asp Leu Ser Asp Tyr Asp Asp 
245 250 255 



Glu Gly Ala Trp Pro Val Leu lie Asp Gin Phe Val Asp Tyr Cys Arg 
260 265 270 



Glu Asn Leu Asn Tyr Pro Lys Pro Gly Asn Ala Ser Asn Asp Gin Gin 
275 280 285 
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Met Glu Thr Pro Lys lie Ala Gin Lys Lys Pro Gly lie Phe Tyr Phe 
290 295 300 



Asn Ser Asn Leu Gin Leu lie Glu Phe Lys Leu Phe Gin Tyr Pro Met 
305 310 315 320 



Leu Lys Thr lie Phe Lys lie Thr lie His Thr Ala Gly Thr Asn Arg 
325 330 335 



<210> 9 

<211> 283 

<212> PRT 

<213> C. elegans 

<400> 9 

Met Asn Arg Leu Lys Ser Asp Gin Lys Thr Lys lie Glu Arg Leu Phe 
15 10 15 



Asn Gin Tyr Val Asp Pro Lys Asp Lys Val Gly Glu Lys Arg Met Gly 
20 25 30 



Pro His Gly lie Asn Arg Leu Leu Thr Asp Leu Gly Tyr Glu Ala Thr 
35 40 45 



Asp Arg Arg Val Leu Val Leu Ala Trp Lys Phe Thr Ala Gin Thr Gin 
50 55 60 



Cys Glu Phe Ser Leu Asp Glu Trp Val Lys Gly Met Thr Ala Leu Gin 
65 70 75 80 



Ala Asp Thr Val Gin Asn Leu Arg Gin Arg lie Asp Ser lie Asn Ser 
85 90 95 



Gly Leu Glu Ser Asp Lys Ala Lys Phe His Glu Leu Tyr Leu Phe Ala 
100 105 110 



Phe Asn Tyr Ala Lys Ser Ala Ala Cys Arg Asn Leu Asp Leu Glu Thr 
115 120 125 
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Ala He Cys Cys Trp Asp Val Leu Phe Gly Gin Arg Ser Thr He Met 
130 135 140 



Thr Gin Trp He Asp Phe Leu Trp Ala Gin Glu Asn Ala Ala Ala Ser 
145 150 155 160 



Arg Leu Ala Gin Asn Val Gly Ala Ser Asn Ala Lys Gin Phe Lys Ser 
165 170 175 



Val Trp He Ser Arg Asp Thr Trp Asn Leu Phe Trp Asp Phe He Leu 
180 185 190 



Leu Ser Lys Pro Asp Leu Ser Asp Tyr Asp Asp Glu Gly Ala Trp Pro 
195 200 205 



Val Leu He Asp Gin Phe Val Asp Tyr Cys Arg Glu Asn Leu Asn Tyr 
210 215 220 



Pro Lys Pro Gly Asn Ala Ser Asn Asp Gin Gin Met Glu Thr Pro Lys 
225 230 235 240 



He Ala Gin Lys Lys Pro Gly He Phe Tyr Phe Asn Ser Asn Leu Gin 
245 250 255 



Leu He Glu Phe Lys Leu Phe Gin Tyr Pro Met Leu Lys Thr He Phe 
260 265 270 



Lys He Thr He His Thr Ala Gly Thr Asn Arg 
275 280 



<210> 10 

<211> 1010 

<212> PRT 

<213> C. elegans 

<400> 10 

Met Ser Met Glu Pro Arg Lys Lys Arg Asn Ser He Leu Lys Val Arg 
15 10 15 
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Gin Ala Val Glu Thr He Glu Glu Thr Val Met Asn Ser Gly Pro Ser 

20 25 30 



Ser Thr Thr Thr Asn Arg Arg Val Ser Phe His Asn Val Lys His Val 
35 40 45 



Lys Gin Tyr Asp Arg Asp His Gly Lys He Leu Asp Ala Thr Pro Val 
50 55 60 



Lys Glu Lys He Thr Asp Thr He Gly Ser Asp Gly He Leu Thr Pro 
65 70 75 80 



Arg Gly Gly Asn Met Asp He Ser Glu Ser Pro Ala Cys Thr Ser Ser 
85 90 95 



Phe Gin Val Phe Gly Gly Gly Asn Leu Asp Lys Thr Met Asp Met Ser 
100 105 110 



Leu Glu Thr Thr He Asn Glu Asn Asn Glu Thr Ala Arg Leu Phe Glu 
115 120 125 



Thr Thr Arg Asp Pro Thr Leu Leu Tyr Glu Lys He Val Glu Thr Thr 
130 135 140 



Thr Lys Val Thr Glu Arg He Val Ser Met Pro Leu Asp Asp Thr Leu 
145 150 155 160 



Ala Met Phe Asn Thr Thr Asn Gin Glu Asp Lys Asp Met Ser Val Asp 
165 170 175 



Arg Ser Val Leu Phe Thr He Pro Lys Val Pro Lys His Asn Ala Thr 
180 185 190 



Met Asn Arg Thr He Pro Met Asp Leu Asp Glu Ser Lys Ala Ala Gly 
195 200 205 



Gly Gin Cys Asp Glu Thr Met Asn Val Phe Asn Phe Thr Asn Leu Glu 
210 215 • 220 
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Ala Ala Glu Met Asp Thr Ser Lys Leu Asp Glu Asn Asn Thr Met Asn 
225 230 235 240 



Ala He Arg He Pro He Asn Ser Asn Val Met Pro Val Asp Met Asp 
245 250 255 



He Thr Glu His His Thr Leu He Glu Glu Lys Lys Asn Asp Thr Phe 
260 265 270 



Gly Pro Ser Gin Leu Met Asp He Ser Ala Pro Gin Val Gin Val Asn 
275 280 285 



Asp Thr Leu Ala He Phe Asn Ser Pro Arg Asp He Cys Asn Lys Gly 
290 295 300 



Leu Gly Val Pro Gin Asn Leu He Asn He Ala Ser Asn Val Val Pro 
305 310 315 320 



Val Asp Met Asp He Thr Asp Gin Ala Val Leu Asn Ala Glu Lys Lys 
325 .330 335 



Asn Asp Gin Phe Glu Thr Ser Gin Leu Met Asp He Ser He Pro Lys 
340 345 350 



Val Leu Val Asn Asp Thr Met Ala Met Phe Asn Ser Pro Lys His Val 
355 360 365 



Ser Lys Ser Ser Met Asp Leu Glu Lys Thr He Glu Ala Ala Asp Lys 
370 375 380 



Ser Thr Lys Tyr Pro Ser He Ala Asp Glu Val Glu Asp Leu Asp Met 
385 390 395 400 



Asp Met Asp He Thr Glu Gin Gin Pro Cys Glu Ala Gly Asn Gin Gin 
405 410 415 



Asn Asp Gly Leu Gin Leu Gin Lys Glu Asp Leu Met Asp He Ser Val 
420 425 430 
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He Arg Asp Ser Pro Ala Val Asn Asp Thr Met Ala Val Phe Gin Ser 
435 440 445 



Pro Ala Arg Val Lys He Gly Ala Asn Asn Ser He He Asp Ser Gin 
450 455 460 



Lys Ser He Val Phe Gly Asp Glu Met Ser He Asp Glu Thr Gin Asn 
465 470 475 480 



Asp Gly Thr Leu Thr Leu Pro Lys Ser Asn Val Glu Val Thr Thr Thr 
485 490 495 



Asn Asp Val Tyr Thr Ser Leu Glu Arg Gin Glu Glu Asn Ala Ser Glu 
500 505 510 



Asn Val Ser Met He Asn Glu Ser Ser Val His Ser Glu He Asp Lys 
515 520 525 



Lys Ser Phe Met Leu He Glu Glu Glu Arg Ala Phe Met His Ser Ser 
530 535 540 



Met He Asp Val Ala Gin Lys Leu Glu Asp Asp Gly Ser Ser Lys Thr 
545 550 555 560 



Pro Val He Leu Ala Ser Gin Ser Ala Ser Leu Ala Thr Lys Glu Pro 
565 570 575 



Ser Ala Leu His Asn Ser Ser Ala Thr Leu Asn Asn Ser Met Glu Leu 
580 585 590 



Asp Asn Asn Thr Leu Leu Lys Thr Met Gin He Thr Thr Cys Glu Asp 
595 600 605 



He Ser Met Val His Glu Ser He Ala Val Glu Leu Asn Ser Asn Lys 
610 615 620 



Glu Gin Glu Gin Phe Gly Asp Glu Thr Leu Gin Lys Asn Asp Thr Ser 
625 630 635 640 
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Asn Thr Gly Ala Asn Phe Thr Phe Gin Gly His Asn Glu Thr Ser Gin 
645 650 655 



He Met Asn Asn Val Asp Ser Glu Ala Val Asn Thr Ser Lys He Ser 
660 665 670 



Thr Tyr Ser Ala Phe Asn Leu Ser He Asn Gin Ser He Ser Lys Arg 
675 680 685 



Arg Arg Ser Leu Leu Asn Ser Ala Arg Glu Ser Pro Arg Arg Val Ala 
690 695 700 



Leu Glu Asn Ser He Met Ser Met Asn Gly Gin Thr Met Glu Ala Leu 
705 710 715 720 



Thr Glu Tyr Arg Gin Asn Lys Thr Met Gin Thr Ser Gin Asp Ser Met 
725 730 735 



Pro Ser Met Ser Leu Asn Asp Ser Gly Arg Asp He Leu Ala Met Asn 
740 745 750 



Thr Ser Val Arg Ser Pro His Leu Asn Ser Ser Lys Thr Ala Ala Pro 
755 760 765 



Gly Thr Pro Ser Leu Met Ser Gin Asn Val Gin Leu Pro Pro Pro Ser 
770 775 780 



Pro Gin Phe Glu Met Pro Asp Phe Asp Pro Ala Val Val Asn Val Val 
785 790 795 800 



Tyr Leu Thr Ser Glu Asp Pro Ser Thr Glu Gin His Pro Glu Ala Leu 
805 810 815 



Lys Phe Gin Arg He Val Glu Asn Glu Lys Met Lys Val Gin His Glu 
820 825 830 



He Asp Ser Leu Asn Ser Thr Asn Gin Leu Ser Ala Glu Lys He Asp 
835 840 845 
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Met Leu Lys Thr Lys Glu Leu Leu Lys Phe Ser His Asp Glu Arg Glu 
850 855 860 



Ala lie Met lie Ala Arg Lys Asp Ala Glu lie Lys Phe Leu Glu Leu 
865 870 875 880 



Arg Leu Lys Phe Ala Leu Glu Lys Lys He Glu Ser Asp Gin Glu He 
885 890 895 



Ala Glu Leu Glu Gin Gly Asn Ser Lys Met Ala Glu Gin Leu Arg Gly 
900 905 910 



Leu Asp Lys Met Ala Val Val Gin Lys Glu Leu Glu Lys Leu Arg Ser 
915 920 925 



Leu Pro Pro Ser Arg Glu Glu Ser Gly Lys He Arg Lys Glu Trp Met 
930 935 940 



Glu Met Lys Gin Trp Glu Phe Asp Gin Lys Met Lys Ala Leu Arg Asn 
945 950 955 960 



Val Arg Ser Asn Met He Ala Leu Arg Ser Glu Lys Asn Ala Leu Glu 
965 970 975 



Met Lys Val Ala Glu Glu His Glu Lys Phe Ala Gin Arg Asn Asp Leu 
980 985 990 



Lys Lys Ser Arg Met Leu Val Phe Ser Lys Ala Val Lys Lys He Val 
995 1000 1005 



Asn Phe 
1010 



<210> 11 

<211> 1207 

<212> PRT 

<213> C. elegans 

<400> 11 



Met Ser Thr He Thr Ser Gin Lys Gly He Arg Leu Leu Thr Glu Arg 
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10 15 



Arg Gly Asp Asn Ser Leu lie Leu Thr Leu Thr Leu His Ser Leu Cys 
20 25 30 



Ser Ser Pro His Leu Ser Ser Phe Phe Asp lie Gly Cys Gly Phe Leu 
35 40 45 



Ser Pro Asn Asn Lys Asn Ala Met Asn Thr Ser His Asn Ser Phe Phe 
50 55 60 



Phe Phe Leu Leu Leu Phe Leu Phe Ser Phe Phe Leu Pro Phe Ala lie 
65 70 75 80 



Gin Leu Phe Gly Lys Leu Pro Asn Ser Lys Lys Met Trp Ala Phe Pro 
85 90 95 



Ala lie Leu Ser lie Asn Val Asn Leu lie Ser Arg Lys Leu Met Val 
100 105 110 



Thr Val lie Pro Lys lie lie Ser Ser Pro Tyr Pro Arg Thr Arg Leu 
115 120 125 



Pro Leu Tyr Leu Tyr Thr Val Ser lie lie lie Ser Cys Ser Leu Leu 
130 135 140 



Tyr Trp Asn Leu Leu Tyr Cys Lys Asn Tyr Asp Cys Val Val Glu Lys 
145 150 155 160 



Glu Phe Arg Trp Gly Ser Thr Arg His Leu Leu Gin Tyr Phe Pro Val 
165 170 175 



lie Ala Ala Pro lie lie Met Val lie Ser Phe Ser Trp Leu lie lie 
180 185 190 



Ala lie Tyr Tyr Ser Ser Ser Ser Cys Val Leu Thr Phe Asn Phe Met 
195 200 205 



Glu Met Pro Ser Ala Val Leu Cys Ser Leu Leu Gly Gly lie Ser Ser 
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210 215 220 



Val lie Glu He His Phe Ser He Glu Val Asn Gin Val Gin Trp Thr 
225 230 235 240 



Asp Gin Trp Leu Leu Ser Ser Val Gly Leu Pro He Asn Asp Cys Leu 
245 250 255 



Lys He Asp He Phe Arg Asp Leu Gin Tyr Phe Tyr Ala Phe Tyr Met 
260 265 270 



Leu Gin Leu Arg Ser His Phe Asn Asn Pro Ser Asn He Phe Glu Phe 
275 280 285 



Pro He Phe Phe Lys Ser Met Asn Gin Lys Tyr Tyr Val Asn Cys Asp 
290 295 300 



He Tyr Ser Cys Ser He His Phe Met Lys Lys Gin Lys Lys Met Ser 
305 310 315 320 



Phe Ser Gin Ala Gla Asp Val Tyr Leu Arg Leu Lys Gin Glu Lys Glu 
325 330 335 



Glu Glu Lys Gin Arg Glu Arg Ala Glu Arg Glu Lys Arg Asn Glu Thr 
340 345 350 



He Ala Ala Thr Asn Lys Ser Arg Lys Lys Met Asn Gin Ala Leu Ala 
355 360 365 



Lys Arg Asn Lys Lys Gly Gin Pro Asn Leu Asn Ala Gin Met Asp Met 
370 375 380 



Ala Ser Asp Glu Asn He Gly Ala Asp Gly Glu Gin Lys Pro Ser Arg 
385 390 395 400 



Pro Phe Leu Arg Lys Gly Gin Gly Thr Ala Arg Phe Arg Met Val Val 
405 410 415 



Cys Ala Asn Thr Arg Leu He Glu He He Tyr Glu Val Gin Pro Arg 
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420 425 430 



Asn Asn Lys Thr Ser Ala Gly Ala Pro Pro Thr Ser Glu Leu Ser Ser 
435 440 445 



Ala Ser Ser Pro Ser lie Asn Val Pro Arg Phe Ser Leu Ser Asn Ala 
450 455 460 



Leu Pro Asn Ser Ala Arg Thr Val Asp Ser Gly lie Ser Asn Glu Asp 
465 470 475 480 



Glu Thr Arg Pro Pro Thr Thr Ala Ser Leu Pro Met Asp Gin Pro Ser 
485 490 495 



Leu Ser Ser Ser Pro Glu Asn Arg Leu Asn Pro Ala Pro Ser Val Ala 

500 505 510 



Glu Glu His Gly His Ser Gly Gin His Ala Glu Glu Glu Glu Asp Asn 
515 520 525 



Asp Thr Asp Glu Val Ser Ala Met Pro Ser Phe Val Pro Asp Glu Pro 
530 535 540 



Ser Thr Leu Val Asn Ser Asp His Glu Leu Ser Asp Asp Ala Leu Lys 
545 550 555 560 



Tyr Lys Asn Ala Ala Ala Glu Phe Lys Ala Phe Glu Arg Arg Met Asp 
565 570 575 



Ser Met Arg Ser Ala Ser Thr lie Thr Thr Ser Leu Ala Thr Pro Ser 
580 585 590 



Ser Cys Ala Pro Ser Asn Ser Ser Glu Pro Pro Thr Arg Ser Thr Pro 
595 600 605 



lie Met Asn Asp Leu Gly Val Gly Pro Asn Asn His Asn Trp Pro Ser 
610 615 620 



Ser Met Gin Glu Leu Ser Gly lie Ser Leu Glu Thr Pro Gin Ala Arg 
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625 630 635 640 



Pro Leu Gly Ser Asn Arg lie Asn Gin Leu Val Arg Ser Glu Ala Gin 
645 650 655 



Thr Gly lie Ser Leu Leu Gin His His Glu Arg Pro Thr Val Thr Ala 
660 665 670 



Pro Leu Arg Arg Asn Asp Met Met Asn Ser Ser Arg Gin Asn Pro Gin 
675 680 685 



Asn Gly Asn Val Gin Asp Glu Asn Arg Pro Glu His Val Tyr Asp Gin 
690 695 700 



Pro He His Val Pro Gly Ser Ser Leu Asp Arg Gin Lys Leu Glu He 
705 710 715 720 



Glu He Arg Arg His Arg Asn Leu Asn He Gin Leu Arg Asp Thr He 
725 730 735 



Ala His Leu Asp Tyr Ala Glu Glu Ser Val His Thr Thr Lys Arg Gin 
740 745 750 



Leu Glu Glu Lys He Ser Glu Val Asn Asn Phe Lys Lys Glu Leu He 
755 760 765 



Glu Glu Phe Lys Lys Cys Lys Lys Gly Val Glu Glu Glu Phe Glu Lys 
770 775 780 



Lys Phe Glu Lys He Lys Glu Asp Tyr Asp Glu Leu Tyr Glu Lys Leu 
785 790 795 800 



Lys Arg Asp Gin Arg Asp Leu Glu Arg Asp Gin Lys He Leu Lys Lys 
805 810 815 



Gly Thr Gly Glu Arg Asn Lys Glu Phe Thr Glu Thr He Ala Thr Leu 
820 825 830 



Arg Asp Lys Leu Arg Ala Ser Glu Thr Lys Asn Ala Gin Tyr Arg Gin 
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835 840 845 



Asp He Arg Val Arg Asp Glu Lys Leu Lys Lys Lys Asp Glu Glu He 
850 855 860 



Glu Lys Leu Gin Lys Asp Gly Asn Arg Leu Lys Ser Thr Leu Gin Thr 
865 870 875 880 



Leu Glu Lys Arg Val Lys Gin Leu Arg Thr Glu Lys Glu Arg Asp Asp 
885 890 895 



Lys Glu Lys Glu Met Phe Ala Lys Val Ala Met Asn Arg Lys Thr Ser 
900 905 910 



Asn Pro Val Pro Pro Val Leu Asn Gin Ser Val Pro He Ser He Thr 
915 920 925 



Ser Asn Gly Pro Ser Arg His Pro Ser Ser Ser Ser Leu Thr Thr Phe 
930 935 940 



Arg Lys Pro Ser Thr Ser Asn Arg Glu Arg Gly Val Ser Trp Ala Asp 
945 950 955 960 



Glu Pro Asn Glu Gin Ser Leu Glu Ala Val Pro Gin Glu Phe Leu Met 
965 970 975 



Met Pro Val Lys Glu Met Pro Gly Lys Phe Gly Lys Cys Thr He Tyr 
980 985 990 



Arg Asp Ser Leu Gly Glu Thr Ser Lys Val Thr Asp Thr He Ala Asn 
995 1000 1005 



Gly Leu Leu Phe Glu Tyr Ser Asn Gly Asp Leu Arg Trp Val Asn 
1010 1015 1020 



Arg Gin Asn Ala Val Asn He Tyr He Ser Ala Val Asp Lys Thr 
1025 1030 1035 



Val Arg He Asp Leu Pro Thr Tyr Asn He Ser He He His Thr 
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1040 1045 1050 



Phe Gin Arg Gin Val Glu Val Leu Arg Pro Gly Asn Asn He Thr 
1055 1060 1065 



Leu He Ser He Lys Arg Arg Glu Val Arg Thr Asp Leu He Tyr 
1070 1075 1080 



Gin Asn Gly Met Tyr Lys Thr Glu He Phe Asn Arg Asp Gly Arg 
1085 1090 1095 



Tyr Val Thr Lys Asp Phe Ser Asn Gin Glu Val Ser Arg Lys Tyr 
1100 1105 1110 



Asn Pro Gly Thr His Thr Tyr Arg Asp Asn Gin Cys Arg Tyr Val 
1115 1120 1125 



Leu Val Thr Asp Tyr Asn Asp Phe Glu Leu Val Glu Pro Glu Phe 
1130 1135 1140 



Arg Leu Arg Trp Tyr Gin Gly Asp Pro Thr Gly Leu Asn Asn Gin 
1145 1150 1155 



Tyr He Leu Lys He He Gly Arg Pro Glu Cys Ser Glu Lys Thr 
1160 1165 1170 



Leu Arg Leu Glu Val Asn Leu Ser Thr Cys Glu Gly Thr Leu Glu 
1175 1180 1185 



Thr Ala Glu Met He Gly Asp Lys Arg Arg Lys Thr Thr Leu Phe 
1190 1195 1200 



Gin Trp Lys Lys 
1205 



<210> 12 

<211> 780 

<212> DNA 

<213> homo sapiens 
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<400> 12 



atgaacaagt 


tgaaatcatc 


gcagaaggat aaagttcgtc 


agtttatgat 


cttcacacaa 


60 


tctagtgaaa 


aaacagcagt 


aagttgtctt tctcaaaatg 


actggaagtt 


agatgttgca 


120 


acagataatt 


ttttccaaaa 


tcctgaactt tatatacgag 


agagtgtaaa 


aggatcattg 


180 


gacaggaaga 


agttagaaca 


gctgtacaat agatacaaag 


accctcaaga 


tgagaataaa 


24C 


attggaatag 


atggcataca 


gcagttctgt gatgacctgg 


cactcgatcc 


agccagcatt 


30C 


agtgtgttga 


ttattgcgtg 


gaagttcaga gcagcaacac 


agtgcgagtt 


ctccaaacag 


36C 


gagttcatgg 


atggcatgac 


agaattagga tgtgacagca 


tagaacaact 


aaaggcccag 


42C 


atacccaaga 


tggaacaaga 


attgaaagaa. ccaggacgat 


ttaaggattt 


ttaccagttt 


48C 


acttttaatt 


ttgcaaagaa 


tccaggacaa aaaggattag 


atctagaaat 


ggccattgcc 


54C 


tactggaact 


tagtgcttaa 


tggaagattt aaattcttag 


acttatggaa 


taaatttttg 


60( 


ttggaacatc 


ataaacgatc 


aataccaaaa gacacttgga 


atcttctttt 


agacttcagt 


66C 


acgatgattg 


cagatgacat 


gtctaattat gatgaagaag 


gagcatggcc 


tgttcttatt 


72( 


gatgactttg 


tggaatttgc 


acgccctcaa attgctggga 


caaaaagtac 


aacagtgtag 


78( 



<210> 13 

<211> 259 

<212> PRT 

<213> homo sapiens 

<400> 13 

Met Asn Lys Leu Lys Ser Ser Gin Lys Asp Lys Val Arg Gin Phe Met 
15 10 15 

lie Phe Thr Gin Ser Ser Glu Lys Thr Ala Val Ser Cys Leu Ser Gin 
20 25 30 

Asn Asp Trp Lys Leu Asp Val Ala Thr Asp Asn Phe Phe Gin Asn Pro 
35 40 45 

Glu Leu Tyr lie Arg Glu Ser Val Lys Gly Ser Leu Asp Arg Lys Lys 
50 55 60 

Leu Glu Gin Leu Tyr Asn Arg Tyr Lys Asp Pro Gin Asp Glu Asn Lys 
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65 70 75 80 



lie Gly lie Asp Gly lie Gin Gin Phe Cys Asp Asp Leu Ala Leu Asp 
85 90 95 



Pro Ala Ser lie Ser Val Leu lie lie Ala Trp Lys Phe Arg Ala Ala 
100 105 110 



Thr Gin Cys Glu Phe Ser Lys Gin Glu Phe Met Asp Gly Met Thr Glu 
115 120 125 



Leu Gly Cys Asp Ser lie Glu Gin Leu Lys Ala Gin lie Pro Lys Met 
130 135 140 



Glu Gin Glu Leu Lys Glu Pro Gly Arg Phe Lys Asp Phe Tyr Gin Phe 
145 150 155 160 



Thr Phe Asn Phe Ala Lys Asn Pro Gly Gin Lys Gly Leu Asp Leu Glu 
165 170 175 



Met Ala lie Ala Tyr Trp Asn Leu Val Leu Asn Gly Arg Phe Lys Phe 
180 185 190 



Leu Asp Leu Trp Asn Lys Phe Leu Leu Glu His His Lys Arg Ser lie 
195 200 205 



Pro Lys Asp Thr Trp Asn Leu Leu Leu Asp Phe Ser Thr Met lie Ala 
210 215 220 



Asp Asp Met Ser Asn Tyr Asp Glu Glu Gly Ala Trp Pro Val Leu lie 
225 230 235 240 



Asp Asp Phe Val Glu Phe Ala Arg Pro Gin He Ala Gly Thr Lys Ser 
245 250 255 



Thr Thr Val 



<210> 14 
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<211> 258 
<212> PRT 
<213> Homo sapiens 

<400> 14 

Ser Lys Gin Glu Phe Met Asp Gly Met Thr Glu Leu Gly Cys Asp Ser 
15 10 15 



He Glu Gin Leu Lys Ala Gin He Pro Lys Met Glu Gin Glu Leu Lys 
20 25 30 



Glu Pro Gly Arg Phe Lys Asp Phe Tyr Gin Phe Thr Phe Asn Phe Ala 
35 40 45 



Lys Asn Pro Gly Gin Lys Gly Leu Asp Leu Glu Asp Arg Lys Lys Leu 
50 55 60 



Glu Gin Leu Tyr Asn Arg Tyr Lys Asp Pro Gin Asp Glu Asn Lys He 
65 70 75 80 



Gly He Asp Gly He Gin Gin Phe Cys Asp Asp Leu Ala Leu Asp Pro 
85 90 95 



Ala Ser He Ser Val Leu He He Ala Trp Lys Phe Arg Ala Ala Thr 
100 105 110 



Gin Cys Glu Phe Ser Lys Gin Glu Phe Met Asp Gly Met Thr Glu Leu 
115 120 125 



Gly Cys Asp Ser He Glu Gin Leu Lys Ala Gin He Pro Lys Met Glu 
130 135 140 



Gin Glu Leu Lys Glu Pro Gly Arg Phe Lys Asp Phe Tyr Gin Phe Thr 
145 150 155 160 



Phe Asn Phe Ala Lys Asn Pro Gly Gin Lys Gly Leu Asp Leu Glu Met 
165 170 175 



Ala He Ala Tyr Trp Asn Leu Val Leu Asn Gly Arg Phe Lys Phe Leu 
180 185 190 
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Asp Leu Trp Asn Lys Phe Leu Leu Glu His His Lys Arg Ser lie Pro 
195 200 205 



Lys Asp Thr Trp Asn Leu Leu Leu Asp Phe Ser Thr Met lie Ala Asp 
210 215 220 



Asp Met Ser Asn Tyr Asp Glu Glu Gly Ala Trp Pro Val Leu He Asp 
225 230 235 240 



Asp Phe Val Glu Phe Ala Arg Pro Gin He Ala Gly Thr Lys Ser Thr 
245 • 250 255 



Thr Val 



<210> 15 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> T7 polymerase promoter sequence (example 1) 

<400> 15 

taatacgact cactatagg 



<210> 16 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> T3 polymerase promoter sequence 

<400> 16 

aattaaccct cactaaagg 



<210> 17 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 
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<223> oligonucleotide for PGR amplification (example 3) 
<400> 17 

tcaatcagta tgtcgaccc 



<210> 18 

<211> 19 

<212> DNA 

<213> artificial sequence 



<220> 

<223> oligonucleotide for PCR amplification (example 3) 
<400> 18 

ggaagaaatt ggggaaaca 



<210> 19 

<211> 19 

<212> DNA 

<213> artificial sequence 



<220> 

<223> oligonucleotide for PCR amplification. (example 3) 
<400> 19 

atcgagcgcc tcttcaatc 



<210> 20 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 3) 

<400> 20 

tggtgtctcc atttgctga 



<210> 21 
<211> 19 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 4) 
<40G> 21 

atctgaagat ccgtccact 
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<210> 22 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (exan?)le 4) 

<400> 22 

atgcacaatg ggtattttt 



<210> 23 
<211> 23 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 5; forward 
primer to generate dsRNA 305A12) 

<400> 23 

ttcgtctcga acacgtatat cct 



<210> 24 
<211> 23 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 5; reverse 
primer to generate dsRNA 305A12) 

<400> 24 

gaaagaagat gaatcaggca ttg 



<210> 25 
<211> 23 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 5; forward 
primer to generate dsRNA 341G5) 

<400> 25 

ctgcaaaaat tatgactgtg teg 
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<210> 26 
<211> 21 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PGR amplification (example 5; reverse 
primer to generate dsRNA 341G5) 

<400> 26 

agcattcaga tttggttgtc c 
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